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R A CKOROUND HF THE INVENTION 



This application claims priority to and specifically incorporates by reference, the content 
of U.S. Provisional Application Serial No. 60/100,403 filed September 15, 1998 and U.S. 
5 Provisional Application Serial No. 60/102,392 filed September 29, 1998. The entire text of each 
of the above-referenced disclosures is specifically incorporated by reference herein without 
disclaimer. 

1. Field of the Invention 

10 The present invention relates generally to the fields of microbiology and immunology. 

More particularly, it concerns the response of macrophages to endotoxin exposure from Gram- 
negative bacteria and methods of detecting and treating individuals at high risk for infection by 
1 Gram-negative bacteria and inhibiting sepsis and septic shock. 

m 15 2. Description of Related Art 

Jl "Innate" or "natural" immunity is largely subserved by macrophages and NK cells. These 

S cells lack the huge repertoire of pathogen receptors that comprise the afferent limb of "specific" 
O immunity, e. g , the receptors found on T-cells and B-cells. Rather, they rely upon very broadly 
fi effective mechanisms for the recognition of invasive organisms. An important case in point 
S 20 concerns Gram-negative bacteria, all of which bear endotoxin (lipopolysaccharide, LPS) 
% molecules in their outer membrane, which trigger a strong immune response on the part of the 

host which produces a shock-like syndrome, characterized by low blood pressure and 

hyporeactivity to vasoconstricter agents. 

25 When macrophages are exposed to pure preparations of endotoxin, they secrete 

numerous cytokine mediators, including tumor necrosis factor (TNF, TNFa), interleukin-1 (IL- 
1), interferon-a/p, GM/CSF, IL-8 and ultimately smaller "autocoid" molecules, all of which 
mediate an intense inflammatory reaction. Endotoxin recognition acts as an early warning signal 
through which a host may mount a timely defense against invasion by Gram-negative organisms. 
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However, widespread — of macrophages * endotoxin — - * « * 
septic shoe, By most estimates, Gram-negative septic shock is responsible for 100,0* d«*s 
pe year in the United States alone. The end* syndrome of hypotension, coagulofa, y, 
ZL, edema and acute - failure results, * part, * release ™F and other 
cytokines in response to exposure to endotoxin. 

TNF is probably the most important mediator of local inflammatory processes as well as 
septic endotoxin shoe, 1. ,s produced quickly and in farge amounts b, macrons that 
en ounter endotoxin ,B=ut.er - at, ,985a). TNF causes shock when administered ,0 animals 
, CTracey • a,., M and blockade of TNF synthesis or activity markedly attenuates *e letal 
effect of endotoxin (BeuUer „ a,., ,985b). The relarionship between endotoxm and TNF 

■ i i.„„.v.r little is known about the mechanism by 
therefore directly correlated. Surpnsmgly, however, little ts known 

which endotoxin triggers host cells to release TNF. 

The cognate receptor for endotoxin, complexed with lipopolysaceharide binding protein 
in Pl asma, ,s the GPHinked ceil surface membra protein 0>14. CD14 ««« 
endotoxin onto the surface of macrophages and other defensive cells of the hos 

(Wright e,ai, 1990). However, it dees no, actually signal the presence of endotoxm, 

has no cytoplasmic component with which to do so. 

Endotoxin is known to trigger both tyrosine and serine phosphorylation events within the 
macrophage ce„, and a, .east in part, », raf, MEK, and members o, the MAP kinase fan* are 
involved ,n signal transducuon (Gepper, - a, 1994). The endpoints of endotoxm signahn 
include activation of the transcription of TNF and various genes, and activauon of the u-at, 
25 of TNF mRNA (Bender e, .986= Han « a,., 1990, At the protein level, this stimulation h, 
endotoxin leads to a severa, thousand-fold augmentation of cytokine biosynthes* y a 
m acrophage cell. Bu, the initial controlling element and even, in the signaling pathway 
macro hage response ,o endotoxin has no, been iden„ M . Thus, m spi,e of i,s importance, 
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Toll-like receptor 2 (TLR2) has been suggested to partially mediate lipopolysaccharide-induced 
cellular signaling (Gerard, 1998; Yang etai, 1998). 

Thirty years ago, mice of the C3H/HeJ strain were noted to be specifically and globally 
5 unresponsive to endotoxin, while closely related animals of the C3H/HeN or C3H/OuJ 
substrains exhibited normal responses (Sultzer, 1968). The median lethal dose of endotoxin is 
more than 100-fold higher in C3H/HeJ mice than in either of these other strains. Macrophages 
of C3H/HeJ mice fail to produce cytokines in response to endotoxin, and B-lymphocytes of 
C3H/HeJ mice are not driven to proliferate by endotoxin. While C3H/HeJ mice are highly 
10 resistant to the lethal effect of endotoxin, they are unusually sensitive to infection by 
gram-negative organisms. The mean lethal inoculum with Salmonella typhimurium, for 
example, is two organisms in C3H/HeJ mice, whereas several thousand organisms are required 
to kill mice of the C3H/HeN strain. Hence, the ability to sense the presence of endotoxin is 
% required for defense against gram-negative organisms and it is speculated that individuals that 
!J1 15 suffer from sepsis and septic shock have a similar genetic mutation which causes them to be 
m more susceptible to infection. 

□ These defective responses by the C3H/HeJ mice are the result of a single, codominant 

y, mutation, which maps to a position between the widely separated Mup-1 (Major urinary protein) 
;1; 20 and Ps (polysyndactyly) loci on mouse chromosome IV (Watson et ai, 1978). Mice 
'S homozygous for the mutant allele of the "Lps gene" are unresponsive to endotoxin, whereas 
homozygotes for the common allele are normally responsive, whether lethality or cell-based 
assays are employed as an index. Heterozygotes exhibit intermediate levels of response. The 
protein encoded by this mutant gene is the most important known determinant of 
25 endotoxin-induced TNF biosynthesis, and indeed, of all reactions to endotoxin. 

Many attempts to identify the product of the Lps gene, or to clone it have been made. 
With the recognition that CD14 serves as the principle cell-surface receptor for endotoxin, it was 
proposed that the Lps gene might encode an associated polypeptide chain with signal 
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transducing potential, or more broadly, an early component of the signal transduction apparatus. 
Attempts to identify a CD 14 binding molecule, which might be the product of the Lps gene, 
have been pursued by several investigators. Two-hybrid screening, affinity chromatography, and 
cross linking approaches have thus far each failed to pinpoint a protein that specifically engages 
5 CD 14. Expression cloning strategies have also been applied in the search for the Lps gene 
product without success. 



Given the occurrence of gram-negative bacteremia and the high and rising incidence of 
gram-negative nosocomial infections, a certain subset of the population appears to be at high 
10 risk to develop endotoxic shock even if adequate antimicrobial therapy is instituted. It would be 
useful to know which patients are at high risk for gram-negative bacterial infections and sepsis 
in advance of its onset. Diagnostic methods that predict the risk of infection as well as the 
clinical course of sepsis could be reasonably applied to most hospitalized patients. In addition, it 
is clear that there is an immediate and increasing need for new drugs and treatment methods that 

CP 15 regulate macrophage response to gram-negative bacterial infections. Unfortunately, given the 

-a 

m current lack of understanding of the regulation of the macrophage response to endotoxin, these 
?=?; drugs and methods have not been developed, and patients continue to be at risk for these life- 
threatening infections. 
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*Jl 20 SUMMARY OF THE INVENTION 



The present invention relates in part to methods for screening for susceptibility to 
infection. These methods are based on the Inventors' discovery that the Toll-4 or TLR-4 
polypeptide plays a role as the LPS receptor and is, therefore, involved in the pathway leading to 
25 immune responses in response to certain infections, including especially, certain infection 
involving Gram negative bacteria. In particular embodiments, the invention provides screening 
methods for identifying individuals at risk for certain infections. Testing positive for such 
screens would permit proactive counseling and/or treatment of susceptible individuals. For 
example, in particular circumstances an individual may be taking immunosuppressive drugs or 
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be immunodeficient. It would be advantageous in this, and in other instances, for the individual 
susceptible to infection, to be apprised of the risk to Gram negative bacterial infection. Recently 
the nomenclature for the Toll-4 protein has been changed to TLR-4 (Toll-like receptor 4). Thus, 
in the context of the present invention it is important to note that Toll-4 and TLR-4 are used 
5 interchangeably. The new nomenclature will be used herein, unless such designation leads to 
ambiguity in certain textual embodiments. 

Such methods for screening for the susceptibility to infection generally comprise: 
obtaining sample nucleic acid from an animal; and analyzing the sample nucleic acid to detect a 
mutation in a gene encoding a TLR-4 polypeptide relative to a sequence of a gene encoding a 
native TLR-4 polypeptide; wherein a mutation in the gene encoding the TLR-4 polypeptide is 
indicative of susceptibility to infection. In most cases, the nucleic acid analyzed is DNA, and 
the step of analyzing the TLR-4-encoding nucleic acid comprises sequencing the TLR-4- 
encoding nucleic acid to obtain a sequence. In order to determine whether a mutation exists in 
the obtained sequence of the TLR-4-encoding nucleic acid, the sequence may be compared to a 
native nucleic acid sequence of TLR-4. For example, the native nucleic acid sequence of TLR-4 
may have a sequence set forth in SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:46, 
SEQ ID NO:47 or SEQ ID NO:48. In another example, the native TLR-4 polypeptide may have 
an amino acid sequence set forth in SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO: 
98 or SEQ ID NO:99. Of course, nay methods of determining whether a mutation is present in 
the gene encoding the TLR-4 polypeptide is within the scope of the invention. 

In cases where the TLR-4-encoding nucleic acid comprises a mutation, that mutation 
may be a point mutation, or any other type of mutation. The step of analyzing the TLR-4- 
25 encoding nucleic acid may comprise PCR, an RNase protection assay, or an RFLP procedure. 
Alternatively, an antibody that discriminates wild-type TLR-4 from mutant TLR-4 nucleic acid 
or TLR-4 polypeptide may be used in an immunodetection format or the TLR-4 polypeptide may 
be directly sequenced. 

-6- 




A:225917(4%BH01!.DOC) 



It is contemplated that the mutation in the TLR-4 encoding nucleic acid may arise 
through deletion mutations, insertion mutations, frameshift mutations, nonsense mutations, 
missense mutations or splice mutations. In particularly preferred embodiments, the sample 
nucleic aid comprises a mutation that results in a change from PRO to HIS at residue 712 of a 
5 TLR-4 protein of SEQ ID NO:99. In other preferred embodiments, the sample nucleic acid 
comprises a mutation that results in a change from GLU to LYS at residue 178, a change from 
ARG to HIS at residue 763, a change from GLN to ARG at residue 188, a change from ASP to 
GLY at residue 299, a change from ASN to SER at residue 329, a change from GLU to LYS at 
residue 474, a change from ARG to HIS at residue 763, a change from TYR to CYS at residue 
10 46, a change from PRO to HIS at residue 145, a change from CYS to TYR at residue 281, a 
change from ASN to HIS at residue 624, or a change from THR to ILE at residue 399 of the 
native TLR-4 polypeptide amino acid sequence of SEQ ID NO:98. In yet other embodiments, 
sg the sample nucleic acid comprises at least a second mutation, wherein the second mutation 
|~ results in a deletion of VAL-GLY-THR at residues 827-829 of the native TLR-4 polypeptide 
\P: 15 amino acid sequence of SEQIDNO:98. 

~ In particular embodiments of the invention, the sample nucleic acid comprises at least 

O one point mutation relative to a nucleic acid sequence from a gene encoding a native TLR-4 
y* polypeptide, wherein the mutation is in nucleotide 2342 of the nucleic acid sequence of SEQ ID 
|~ 20 NO:46. In further embodiments, the sample nucleic acid comprises at least two point mutations 
*0 relative to a nucleic acid sequence from a gene encoding a native TLR-4 polypeptide, wherein at 

least one mutation is a change from nucleotide C to nucleotide A at position 2342 of the nucleic 

acid sequence of SEQ ID NO:46. 

25 In other cases, at least one mutation is in Exon 2, Exon 3 or Intron 2 of the sequence of 

SEQ ID NO:47, wherein said at least one mutation is a change from nucleotide A to nucleotide 
G at position 8457, a change from nucleotide G to nucleotide A at position 8612, a change from 
nucleotide A to nucleotide G at position 8631, a change from nucleotide A to nucleotide G at 
position 12245, a change from nucleotide T to nucleotide C at position 12293, a change from 
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nucleotide C to nucleotide A at position 12412, a change from nucleotide C to nucleotide A at 
position 12413, a change from nucleotide A to nucleotide G at position 12541, a change from 
nucleotide G to nucleotide A at position 12820, a change from nucleotide A to nucleotide G at 
position 12874, a change from nucleotide A to nucleotide G at position 12964, a change from 
5 nucleotide C to nucleotide T at position 13174, a change from nucleotide G to nucleotide A at 
position 13398, a change from nucleotide G to nucleotide A at position 13769, a change from 
nucleotide A to nucleotide C at position 13848, a change from nucleotide G to nucleotide A at 
position 13937, or a change from nucleotide G to nucleotide A at position 114266 of the 
sequence of SEQ ID NO:47. In another embodiment, at least one mutation is a deletion of 
10 nucleotide T at position 12228 of the sequence of SEQ ID NO:47. In a preferred embodiment, at 
least one mutation is a change from nucleotide A to nucleotide G at position 12245 of the gene 
sequence and a deletion of nucleotides 14453 to 14461 of the sequence of SEQ ID NO:47. In 
jSJ other embodiments, the sample nucleic acid sequence comprises at least two mutations relative 
^ to the sequence of SEQ ID NO:47, wherein at least two mutations comprise a change from 
Ifi 15 nucleotide C to nucleotide T at position 12399 and a change from nucleotide G to nucleotide A 
at position 12510, a change from nucleotide C to nucleotide A at position 12413 and a change 
from nucleotide G to nucleotide A at position 14266, or a change from nucleotide A to 
nucleotide G at position 12874 and a change from nucleotide C to nucleotide T at position 
f 2 13 174 of the sequence of SEQ ID NO:47. 



^ 20 



In other embodiments, the present invention relates to methods of reducing susceptibility 
of an animal to infection comprising the step of modulating an LPS mediated response in the 
animal. In most animals, a mutation or other defect can cause the animal to be unable to mount 
an appropriate response in the presence of an infectious agent, for example, a gram negative 
25 bacteria. These methods often comprise diagnosing an animal with an infection or one 
susceptible to infection via analysis of a TLR-4-encoding nucleic acid sequence for a mutation 
relative to a sequence of a gene encoding a native TLR-4 polypeptide, wherein the native TLR-4 
polypeptide is a TLR-4 polypeptide that has the sequence of SEQ ID NO:2, SEQ ID NO:4, SEQ 
ID NO:6, SEQ ID NO:98 or SEQ ID NO:99. In some cases, modulating LPS receptor function 
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comprises providing a TLR-4 polypeptide to the animal. The TLR-4 polypeptide is a native 
TLR-4 polypeptide, for example, one have the sequence of SEQ ID NO:2, SEQ ED NO:4, SEQ 
ID NO:6, SEQ ID NO:98 or SEQ ID NO:99. Alternatively, the TLR-4 polypeptide may be a 
modified TLR-4 polypeptide created by molecular biological methods. In some cases, the 
5 provision of a TLR-4 polypeptide is accomplished by inducing expression of a TLR-4 
polypeptide. For example, the expression of a TLR-4 polypeptide encoded in the animal's 
genome may be induced. Alternatively, the expression of a TLR-4 polypeptide encoded by a 
nucleic acid provided to the animal is induced. In other cases, the provision of a TLR-4 
polypeptide is accomplished by a method comprising introduction of a TLR-4-encoding nucleic 
10 acid to the animal. In some embodiments, the provision of a TLR-4 polypeptide is accomplished 
by injecting a TLR-4 polypeptide into the animal. In yet other embodiments, a TLR-4 
polypeptide is provided as a single chain antibody construct and delivered via adenovirus. The 

O 

irj invention also relates to the inventor's discovery that certain mutants of TLR-4 fail to bind LPS, 
and as a result, the host having such mutations are rendered more susceptible to the bacterial 
ip 15 infection. 

n\ 

- 1 The invention further contemplates methods of modulating an LPS mediated response 

□ comprising modulating TLR-4 function in an animal. Such methods often comprise the step of 
|^ diagnosing the animal via analysis of a TLR-4-encoding nucleic acid sequence for a mutation. 
20 This modulation can be accomplished by providing a TLR-4 polypeptide to the animal in any 

manner discussed above. Alternatively, the modulating TLR-4 function in the animal comprises 

providing a modulator of TLR-4 to the animal. 

The process of modulating an LPS mediated response in the animal may comprise 
25 providing a modulator of TLR-4 to the animal. As used herein, a "modulator of TLR-4" is any 
substance that affects the functioning of TLR-4 in the LPS pathway. For example, the 
modulator of TLR-4 may be an agonist or antagonist of TLR-4. The modulator of may TLR-4 
modulate the transcription and/or translation of a TLR-4-encoding nucleic acid. 
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In some cases, the methods of reducing susceptibility to an infection involve diagnosing 
an animal with susceptibility to infection via analysis of an TLR-4-encoding nucleic acid 
sequence for a mutation, in any of the manners discussed above. 

5 Additional aspects of the invention relate to methods of screening for modulators of an 

LPS mediated response comprising the steps of: a) obtaining a TLR-4 polypeptide; b) 
determining a standard activity profile of the TLR-4 polypeptide; c) contacting the TLR-4 
polypeptide with a putative modulator; and d) assaying for a change in the standard activity 
profile. In these cases, the TLR-4 polypeptide may have the amino acid sequence of SEQ ID 
10 NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:98 or SEQ ID NO:99. The standard activity 
profile of the TLR-4 polypeptide is determined by determining the ability of the TLR-4 
polypeptide to stimulate transcription of a reporter gene, the reporter gene operatively positioned 
under control of a nucleic acid segment comprising a promoter from a TLR-4 gene 



ijl 15 In other embodiments, a method of modulating an LPS mediated response comprising 

m modulating TLR-4 function in an animal is provided, often further comprising the step of 

y = diagnosing the animal for susceptibility to infection via analysis of a TLR-4-encoding nucleic 

□ acid sequence for a mutation relative to a sequence of a gene encoding a native TLR-4 

'.=3. 

polypeptide. In certain embodiments, the animal susceptible to infection is provided a TLR-4 
•tj 20 polypeptide, wherein the TLR-4 polypeptide is a TLR-4 polypeptide that has the sequence of 
i SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:98 or SEQ ID NO:99. In specific 
embodiments, the provision of a TLR-4 polypeptide is accomplished by inducing expression of a 
TLR-4 polypeptide in the animal, introduction of a TLR-4-encoding nucleic acid to the animal 
or by injecting a TLR-4 polypeptide into the animal. In other embodiments, the step of 
25 modulating TLR-4 function in the animal comprises providing a modulator of TLR-4 to the 
animal, wherein the modulator of TLR-4 may be an agonist of TLR-4 or antagonist of TLR-4, 
wherein the agonist of TLR-4 or antagonist of TLR-4 modulates transcription or translation of a 
TLR-4-encoding nucleic acid. 
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In some preferred embodiments, the invention contemplates methods of screening for 
modulators of an LPS mediated response comprising: a) obtaining a TLR-4-encoding nucleic 
acid segment; b) determining a standard transcription and translation activity of the TLR-4- 
encoding nucleic acid sequence; c) contacting the TLR-4-encoding nucleic acid segment with 
5 a putative modulator; d) maintaining the nucleic acid segment and putative modulator under 
conditions that normally allow for TLR-4 transcription and translation; and e) assaying for a 
change in the transcription and translation activity of TLR-4. 

Yet other embodiments relate to modulators of an LPS mediated response prepared by a 
10 process comprising screening for modulators of an LPS mediated response comprising: a) 
obtaining a TLR-4 polypeptide; b) determining a standard activity profile of the TLR-4 
polypeptide; c) contacting the TLR-4 polypeptide with a putative modulator; and d) assaying 
•,j3 for a change in the standard activity profile. Such modulators may be prepared by a process 
;5J comprising screening for modulators of an LPS mediated response comprising: a) obtaining a 
■Ji 15 TLR-4-encoding nucleic acid segment; b) determining a standard transcription and translation 
£0 activity of the TLR-4 nucleic acid sequence; c) contacting the TLR-4-encoding nucleic acid 
" s segment with a putative modulator; d) maintaining the nucleic acid segment and putative 
B modulator under conditions that normally allow for TLR-4 transcription and translation; and e) 
|sg assaying for a change in the transcription and translation activity. 

I 20 

*fl The invention further relates to methods of treating Gram-negative bacterial infections 

comprising administration of an agent that modulates the recognition of endotoxin through an 
LPS mediated response. For example, the agent may simulate or inhibit the activity of a TLR-4 
polypeptide. 

25 

Other objects, features and advantages of the present invention will become apparent 
from the following detailed description. It should be understood, however, that the detailed 
description and the specific examples, while indicating preferred embodiments of the invention, 
are given by way of illustration only, since various changes and modifications within the spirit 
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and scope of the invention will become apparent to those skilled in the art from this detailed 
description. 

Other objects, features and advantages of the present invention will become apparent 
5 from the following detailed description. It should be understood, however, that the detailed 
description and the specific examples, while indicating preferred embodiments of the invention, 
are given by way of illustration only, since various changes and modifications within the spirit 
and scope of the invention will become apparent to those skilled in the art from this detailed 
description. 

10 

BRIEF DESCRIPTION OF THE DRAWINGS 

-,n The following drawings form part of the present specification and are included to further 

*2 demonstrate certain aspects of the present invention. The invention may be better understood by 

l -sj 15 reference to one or more of these drawings in combination with the detailed description of 

jj! specific embodiments presented herein. 

y FIG. 1. Coarse genetic map of the Lps locus with respect to D4MIT markers. 493 

U meioses from an (SWR X C3H/HeJ) X C3H/HeJ backcross were examined, and crossovers used 

5 '.S3 

ii 5 

20 to map the locus with respect to eleven markers, including the D4MIT markers shown and a 
single marker derived from the interferon-a locus. Numbers above double arrows indicate 
centimorgan distances. The gene was confined to a region between a cluster of four inseparable 
markers (D4MIT244, 218,82, and 325) and the single marker D4MIT80. 

25 FIG. 2A and FIG. 2B. Identification of novel microsatellite markers, and mapping 

of the Lps locus with respect to these markers. FIG. 2A, Map based on 493 meioses analyzed 
in the SWR backcross. FIG. 2B, Map based on 1600 meioses analyzed in the C57BL/6 
backcross. Scale refers to physical distances (note 1 Mb bar), determined on the basis of 
fluorescence in situ hybridization data and pulse-field gel electrophoresis, performed using 
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YACs and BACs that span the critical region (FIG. 4 and FIG. 5). Though only one of the 
markers (7.3) could be used in both analyses, the maps shown in FIG. 2A and FIG. 2B are 
perfectly aligned and drawn to the same scale. Interruptions in map FIG. 2B indicate that 
D4MIT164 and D4MIT27 are quite remote from the region of interest. Numerals above 
5 brackets refer to the number of crossovers within each mapping panel that separate individual 
polymorphic markers from the Lps d mutation. D4MIT markers are shown in shaded circles; 
novel markers defined in Table I are placed in the correct physical order (centromere is on the 
left), and indicated by non-shaded circles. The heavily shaded bar coinciding with the zero 
region of each map refers to an area in which no crossover events were observed, between any of 
10 the markers themselves, or between the markers and Lps d . The location of the Lps d mutation is 
necessarily bounded by markers B and 83.3, as indicated by the heavy bar at the bottom of the 
Figure. 

;2j FIG. 3A and FIG. 3B. Primary biological assay data used in defining crossovers 

ffj 15 between Lps d and B (FIG. 3 A), and between Lps d and 83.3 (FIG. 3B). Assays of LPS 
gj response were performed on seven separate days. On each of these days, control assays of TNF 
production by macrophages (FIG. 3 A) or splenocyte proliferation (FIG. 3B) from obligate 
O heterozygotes and obligate homozygotes for the Lps d , allele were performed in parallel with 
assays performed on samples from the seven mice known to have recombination in the interval 
% 20 between B and 83.3. Results of assays performed on obligate heterozygotes are shown as blue 
*fl circles; obligate homozygote results are shown as red circles. The LPS responses of the seven 
mice with a recombination event between markers B and 83.3 are represented as black squares. 
Four of these mice (FIG. 3A) showed recombination between B and Lps d , and three mice 
(FIG. 3B) showed recombination occurred between Lps d and 83.3. No overlap was apparent 
25 between the range of values obtained with obligate homozygotes and obligate heterozygotes, and 
the mice bearing recombination could be unambiguously categorized as responders or 
nonresponders. Each point represents the mean of four replicate cytotoxicity assays (FIG. 3A) or 
splenocyte proliferation assays (FIG. 3B). Error bars are omitted for simplicity, but were 
generally in the range of 10% of the mean value. 
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FIG 4A and FIG. 4B. FIG. 4A: Centromere is to the left. Sixty-three BACs are 
included in this contig. A gap of small size (probably less than 100 kb) separates BACs 24 and 
25 The remainder of the contig is seamless. The apparent gap between BACs 58 and 59 is 
covered by YAC clone 100E4. Vertical lines indicate the ends of selected BACs contactmg 
SP6 or T7 primer sites within the BAG cloning vector. Tic marks indicate specific umque 
markers listed above the contig. Microsatellite markers are given alphabetical or numencal 
designations. D4M1T markers (325, 25, 178, 7, 132, and 83) are shown in large type. Coded 
microsatellite markers were polymorphic with respect to C3H/HeJ and SWR (magenta) or 
C3H/HeJ and C57BU6 (orange). Double arrows at the top of the Figure indicate phystcal 
distances estimated by interphase FISH analysis (numbers indicate megabases; margm of error= 
+ 10%) Vertical lines separating arrows point to the center of the BACs used for FISH Stance 
measurements (L22, 297012, 259N13 346B6, 217B22, 215K4, and 293L15). All BACs are 
drawn to scale, based on sizes obtained through pulsed-field gel electrophores.s. 

FIG 4B: Bars indicate genetic distances assigned by means of two independent 
back-crosses 1 A backcross involving SWR mice yielded 493 meaningful meioses. Three 
crossovers were observed between marker 83.3 and marker 7.11, corresponding to a geneUc 
distance of 0.6 cM, and one crossover was observed between marker D4MTT325 and marker 
25 5 corresponding to a genetic distance of 0.2 cM (magenta bars). A backcross involvmg 
C57BI76 mice yielded 1600 meaningful meioses. Four crossovers were observed between 
marker B and marker A, corresponding to a genetic Stance of 0.25 cM and 13 crossovers were 
observed between marker C and marker B, corresponding to a genetic distance to 0.8 cM 
(orange bar) On 2093 meioses, the mutation is therefore confined to an interval between 
markers B and 83.3. This corresponds to a physical distance of 2.6 Mb (denoted by the blue 
bar) Complete absence of crossovers (the zero region) was observed in the compose of the 
tW ocrossesoveral.2 Mb interval extending from marker A though marker 7.11 (denoted by the 
black bar). No polymorphic markers capable of distinguishing C57BU6 from C3H/HeJ were 
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identified distal to marker 7.1 1 in the critical region. The BACs represented in this Figure are 
as follows: 

(1. 4L22 2. 329E1 3. 331E22 4. 18J9 5. 259B3 6. 147P7 7. 179M4 8. 297012 9. 
5 363L11 10. 353J12 11. 15108 12. 312J8 13. 358P4 14. 327021 15. 297N10 16. 92G10 17. 
259N 1 3 18. 243020 19.21 6C 14 20. 131M6 21.49K20 22. 135017 23. 274K20 24.336A11 
25. 309117 26. 152C16 27. 352P10 28. 58H7 29. 84C8 30. 346B6 31. 373118 32. 288K23 33. 
291G16 34.27608 35.340116 36.269E13 37.62A9 38. 389F 15 39. 353021 40. 197M3 41. 
293J8 42. 220E13 43. 181N19 44. 369F7 45. 430N20 46. 370J14 47. 213015 48. 265H22 49. 
10 20B5 50. 175118 51. 247P7 52. 264N15 53. 204O1 54. 217B22 55. 178D24 56. 300H9 57. 
188A22 58. 289J1 1 59. 152B3 60. 288020 61. 216K4 62. 293L15 63. 147M3) 

;ft All BAC designations refer to the Research Genetics mouse BAC library, with the 

'S exception of BACs 84C8 and 389F15, which were obtained from Genome Systems (mouse 
5 15 C57BL/6 BAC library). 

1 FIG. 5. Minimal contig of BACs and a single YAC, sequenced in the search for Lps. 

y BACs sequenced to completion or to the point of finishing are shown in blue ("complete"). 
M BACs sequenced to a high density, but not yet at the point of finishing, are shown in red ("in 
% 20 progress"). YAC clone 100E4 has also been partially sequenced. BACs 293L15 and 147M3 
^0 have not been sequenced. Bar at the top of the figure refers to genetic distances from limiting 
markers B and 83.3 to Lps d . The zero recombination area is shown in black. Several of the 
polymorphic markers used in mapping (circled) are included as landmarks. The positions of the 
pseudogenes detected are shown at BAC-level resolution; sequences from TLR-4, considered 
25 the prime candidate gene, were detected in BAC 309117 and in BAC 152C16. 

FIG. 6. TLR-4 receptor locus and a portion of the Pappa locus. The two genes were 
identified by GRAIL and BLAST analysis. The orientation and location of TLR-4 and Pappa 
are shown with respect to the nearest genetic markers. 
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- FIG. 7 A: Amino acid sequences of mouse mutant J-Toll-4, mouse N-Toll-4, rat 
TLR-4 and human TLR-4. The mutant mouse J-toll TLR-4 amino acid sequence contains a 
point mutation at rasidue\712 (proline to histidine), not found in the amino acid sequences of N- 
Toll-4, rat TLR-4 joii human TLR-4. The numbering system in this figure does not take into 
account the spacing to maximize the sequence alignment. 

£ FIG. Ilk Sequencing of amplified DNA from the C3H/HeJ, C3H/HeN, SWR, 



10 



C57BL/6, and DBA-2 mice genomes. 



FIG. 8A. Amplification of low abundance control cDNAs from HeN, HeJ, ScSn, 
ScCr, HeN, HeJ, ScSn and ScCr. 

FIG. 8B. TLR-4 mRNA detected by Northern blots using total RNA from 
P 15 macrophages. 



FIG. 9. TLR-4 mRNA induced by LPS in lymphoid tissues. 

FIG. 10. Spatially conserved representation of coding mutations found in TLR4 of 
102 human subjects. Each lightly shaded dot refers to a mutation affecting a residue that is 

20 relatively conserved (either invariant among six mammalian species, or extant in two forms). 
Each black dot refers to a less conserved residue (three or more variants among species). 
Twelve individuals were found to be heterozygous for a double amino acid substitution. One 
individual was found to be heterozygous at only one of the mutant sites. Introns are shortened, 
and non-coding regions are not shown, but the coding region of the three principal human exons 

25 is drawn to an equivalent scale at all points in the illustration. 
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FIG. 11. Spatially conserved representation of coding mutations found in Tlr4 of 35 

Mus musculus strains. As in FIG. 10, each lightly shaded dot. refers to a mutation that is 
relatively conserved, and each black dot refers to a less conserved mutation. 

5 FIG. 12. Genetic distance and probable ancestral relationships among Tlr4 genes of 

35 Mus musculus strains. Numbers within circles denote strains, in accordance with the legend 
of Table IV. Numbers within parentheses indicate the mutational distance (number of mutations 
separating each strain from its presumed ancestor), with reference to both coding and non- 
coding substitutions listed in Table IV. Arrows point in the direction of descent, and in length 
10 are proportionate to distance. Dashed arrows suggest that mice of a given genotype evidently 
contributed genetic information to mice of another strain, given the similarity of the mutations 
observed, though unique mutations are also observed in the latter latter strain, and not all of the 
mutations observed in the former strain are present. The symbol "?" denotes the likelihood of 
an intermediate form prior to interbreeding of strains. 

15 

J FIG. 13. Spline curve illustrating interspecific sequence variation across the Tlr4 

jif? protein. A multiple alignment of Tlr4 sequences from three rodent species (mouse, rat, and 
'% hamster) and three primate species (human, chimpanzee, and baboon) was generated using the 
GCG program pileup. The number of amino acids observed at each residue was plotted using 
20 the program Prism 3.0 (a value of 1 was assigned if a single amino acid was observed among the 
six species; a value of 5 was assigned if five forms were observed among the six species; etc). 
The points were then connected using a cubic spline curve. Ecto, the extracellular domain; TM, 
the transmembrane domain; Prox, the proximal cytoplasmic domain; Dist, the distal cytoplasmic 
domain. Numbering refers to the human sequence. Where a deletion was introduced by pileup, 
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a single mismatch was assumed. Where the sequence was truncated, each absent residue was 
tabulated as a separate mismatch. 

FIG. 14. Schematic illustration of recombinant proteins expressed in RAW 264.7 

Constructs were made by PCR, using cDNA derived from C3H/HeJ and C3H/HeN mice. 
The primers (5'_3'): \ 

ATC GAT ACQAGG AGG CTT GAA TCC C 
and \ 

TAT CGA TAC CAG GAA GCT TGA ATC CC 

were used to generates the full-length amplified products, which were cloned into the 
vector pFLAG-CMV-1 (Sigma^ using Clal and Kpnl sites. The native signal peptide was thus 
removed, and an alternative signal peptide, followed by the flag sequence, was provided by the 
vector. The ectodomain construct was produced using the downstream primer (5'_3 ( ): 

CAG GGT ACC TCA CAG QTG AAA ATA GAA GTG GTA T, 

whereas the two cytoplasmic Momain constructs were produced using the upstream 
primer (5'_3'): \ 

GCC GAA TTC AAT GTA CAA QAC AAT CAT CAG T. 

The latter two constructs were cloned into pFLAG-CMV-1 using EcoRI and Kpnl sites. All 
constructs were verified by DNA sequencing on both strands. All expression constructs were 
shown to yield products of anticipated size in COS cells, after Western blot detection with M2 
monoclonal antibody (not shown). \ 

FIG. 15A. Saturation isotherm of monoclonal antibody M2 binding to a single 
clone of RAW 264.7 cells transfected with an expression vector encoding Tlr4 Lps d . M2 

antibody was labeled to a specific activity of 3.0 x 10 6 cpm/» g using 125 I, by means of the 
iodogen technique 35 . Labeled antibody (concentration range 50 ng/ml to 20 • g/ml) was added 
in a volume of 1.0 ml to monolayers of 2.0 x 10 6 cells in Hank's balanced salt solution, 
supplemented with 10% fetal bovine serum and buffered with 50 mM HEPES, pH 7.4. The cells 
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were maintained at 0_ C for a period of 4 hours. Performed as such, 56.4% saturation of the 
surface receptor was achieved at equilibrium using the highest concentration of antibody. 
Nonlinear regression analysis, based on the assumption of hyperbolic binding kinetics, suggested 
a Kd of 1.0 x 10" 7 A 3.0 x 10* 8 M' 1 and the presence of 2.81 x 10 4 A 4.99 x 10 3 binding sites per 
5 cell. R 2 for the analysis was 0.9576. Controls, performed for all plates, included the addition of 
the flag peptide to the system at a 10 • M concentration in order to block specific binding, and 
only specific binding (in general, approximately 80% of total binding) is presented here. 
Mathematical analysis was performed using using the program Prism 3.0 (GraphPad Software 
Inc). 

10 

FIG. 15B. The expression level of each construct (copy number per cell) was 
determined by direct measurement of equilibrium M2 monoclonal antibody binding at a fixed 
% g concentration, with reference to the nonlinear regression analysis of saturation isotherms 
% presented in FIG 15 A. Each point represents the result of duplicate determinations of specific 
15 binding. Cells transfected with vector alone had zero specific binding (not shown). 

*2 

Iff 

FIG. 15C. Composite analysis of the shift in EC50 caused by expression of 
Q T i r4 Lps-n and T i r4 Lps- d in RAW 1M1 cells Each point represents m EC50 determination 

jss performed on ten independent clones at the stated LPS concentration. For each clone, four 
20 replicate assays were performed at each concentration. Hence, each curve represents the 

^ composite analysis of 240 assays. Transfected macrophages were plated in 24 well plates at a 
density of 5 x 10 5 cells per well, and covered with 1.0 ml of DMEM supplemented with 10% 
FBS. LPS was added to each well at the concentration indicated. After 15 hours of incubation 
the medium was harvested, and TNF concentration was assayed in the standard L-929 
25 cytotoxicity system , using cycloheximide at a concentration of 100 • g/ml to potentiate killing. 
After 15 hours of exposure to diluted macrophage medium or to mouse TNF applied at a range 
of 8 standard concentrations, the number of viable cells was determined by staining with crystal 
violet. A standard curve relating % cytotoxicity to TNF concentration (not shown) was 
generated using Prism 3.0, and was based on an assumption of sigmoidicity. Variance among 
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replicate samples was typically beneath 5% of the mean, and departures of individual samples 
from the sigmoid plot were typically smaller still. Linear estimates of TNF concentration in 
unknown samples were based on non-linear interpolation from the standard curve (also 
performed using Prism 3.0). The curves shown were generated assuming sigmoidicity of 
response with variable slope for each curve, and further assigning a maximum response value of 
1760 pg TNF, which yielded an optimal fit for the most responsive curve. Error bars indicate 
standard deviation among clones at each LPS concentration. Log EC50 values and standard 
error values are shown in the inset table. 

FIG. 15D. Shifts of the log EC50, determined for individual clones. A more 
conservative approach, in which log EC50 values were determined for individual clones 
transfected with vector alone, or with the Tlr^ 5 0 or Tlr4 Lps d constructs. The log EC50 data 
were then represented as a scatter plot, in which each point represents the log EC50 
determination of a single clone (confidence limits not shown for the sake of simplicity), and is 
derived from 24 separate TNF assays performed on samples stimulated over a 10 4 -fold range of 
LPS concentration (100 pg/ml to 1 • g/ml, as well as an unstimulated control). In most 
instances, two determinations of log EC50 were made independently for each clone. The mean 
and standard error of each cluster is shown in the inset, together with p values defining the 
likelihood that the EC50 displacements are attributable to chance. Data were analyzed by means 
of a one-tailed t test, using Welch's correction for unequal variance. 

FIG. 15E. Lack of correlation between level of recombinant receptor expression 
and the magnitude of effect on EC50. Plotting the EC50 of Tlrf 1 * 5 "" transfected cells (nine 
clones; duplicate assays) and Tlrt 1 * 5 "* 1 transfected cells (eight clones; duplicate assays) vs. the 
receptor number measured for each clone (shown separately in Figures 2b and 2d), it is apparent 
that no correlation exists over the range of receptor number surveyed. Inset: p values calculated 
to assess the significance of departure of the slope (determined by linear regression for each set 
of points) from zero. 
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FIG. 16A. Over-expression of the Tlr4 ectodomain fails to inhibit LPS signaling. 

Composite analysis of the shift in EC50 caused by expression of the Tlr4 ectodomain (no 
cytoplasmic domain) or the full-length Tlr4 Lps d protein in RAW 264.7 cells. Control cells were 
5 transfected with the empty vector. Analysis was carried out in a manner identical to that 
described in Figure 2c. Inset: log EC50 for each curve, and standard error. 

FIG. 16B. Distribution of individual log EC50 data for clones expressing the Tlr4 
ecotodomain, the full-length Tlr4 Lps " d protein, or no recombinant receptor (transfected 
10 with empty vector). Inset: the mean and standard error of each cluster, together with p values 
defining the likelihood that the EC50 displacements are attributable to chance. Data were 
ps analyzed by means of a t test, using Welch's correction for unequal variance. 

'D FIG. 17A. Over-expression of the Tlr4 cytoplasmic domain, either with or without 

ifi 15 the Lps-d mutation, impairs signal transduction in RAW 264.7 cells. Composite analysis of 

p the shift in EC50 caused by expression of the Tlr4 ectodomain (no cytoplasmic domain) or the 
full-length Tlr4 Lps " d protein in RAW 264.7 cells. Control cells were transfected with the empty 

!=j 

iO vector. Analysis was carried out in a manner identical to that described in Figure 2c. Inset: log 

u 

j« EC50 for each curve, and standard error. 

5 

*8 20 

FIG. 17B. Distribution of individual log EC50 data for clones expressing the 
cytoplasmic domain of the Tlr4 Lps d protein, the Tlr4 Lps " n protein, or no recombinant 
protein (transfected with empty vector). Inset: the mean and standard error of each cluster, 
together with p values defining the likelihood that the EC50 displacements are attributable to 
25 chance. Data were analyzed by means of a t test, using Welch's correction for unequal variance. 
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DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS 



Bacterial infections represent a significant challenge in the treatments of a wide variety 
of disease. Structurally disparate LPS molecules, produced by many different species of Gram- 
5 negative bacteria, are engaged on the macrophage surface by CD 14 and ultimately trigger the 
release of cytokines such as tumor necrosis factor (TNF) and interleukin-1 (IL-1). These 
cytokines orchestrate the inflammatory response, with its attendant beneficial and harmful 
effects. 

10 Given the ubiquitous presence of Gram-negative bacteria and the high incidence of 

Gram-negative mediated infections, certain individuals are at high risk to develop endotoxic 
shock even if adequate antimicrobial therapy is instituted. Methods for diagnosing patients at 
=n high risk for Gram-negative bacterial infections and sepsis in advance of its onset would be 
|S beneficial. Further, there is a need to determine whether a particular individual may be 
EH 15 susceptible to Gram-negative bacterial infection, in determining the course of treatment for any 
gj number of diseases. For example, it will be beneficial to test individuals who are candidates for 
] = l immuno-suppressive drug therapy, for susceptibility to Gram-negative bacteria in order to assess 
y the risks of immunosuppressive treatment. Such diagnostic methods to predict the risk of 
!=s infection as well as the clinical course of sepsis could be reasonably applied to most hospitalized 
jii 20 patients. The present invention is directed towards addressing these needs. 



The claimed invention also provides methods of identifying agents which stimulate 
endotoxin signaling such that TNF and other cytokines are released from macrophages. It is 
envisioned that such agents will be of therapeutic use in the treatment of individuals who are 
25 insensitive to endotoxin, either through genetic defect, disease or other condition, and whose 
immune system requires external stimulation to recognize infection from Gram-negative 
bacteria. 
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A. The Present Invention 

In broad aspects, the present invention provides methods for screening for susceptibility 
to infection. On the basis of 2093 meioses analyzed in two separate intraspecific backcrosses, 
the location of the mouse Lps d mutation has been circumscribed to a genetic interval 0.9 cM in 
5 size. To identify gene candidates, nearly 40,000 sequencing runs were performed across the 
critical region. Selective hybridization and exon trapping were also employed to identify genes 
throughout the "zero" region. These studies revealed that only a single intact gene was 
identified within the entire critical region. This gene encodes the TLR-4 receptor, a member of 
the IL-1 family of receptors. Thus, the present inventors demonstrate that there is a mutation in 
10 the TLR-4-encoding gene that appears to provide a predisposition to infection. 

In these studies a total of 19 genetic markers that lie in close proximity to the mutation 
were examined in mapping. Most of these were previously unpublished polymorphic 
microsatellites, identified by fragmentation of YAC and BAC clones spanning the region of 

15 interest. Lps d was found to be inseparable from the microsatellite marker D4MIT178, and from 
three novel polymorphic microsatellites identified near D4MIT178. The mutation was confined 
between two novel microsatellite markers, herein designated "B" and "83.3." B lies centromeric 
to the mutation, and was separated by four crossovers in a panel of 1600 mice; 83.3 lies distal to 
the mutation and was separated by three crossovers in a panel of 493 mice. 66 BAC clones and 

20 one YAC clone were assembled to cover >95% of the critical region. Estimates based on 
pulsed field gel electrophoresis and fluorescence in situ hybridization indicate that the B-» 83.3 
interval is about 3.2 Mb in length. A minimal area of zero recombinational distance from Lps d 
was also assigned, and found to occupy approximately 1.2 Mb of physical size. 

25 It was found that in the macrophages of mice that are susceptible to bacterial infection 

there is a genetic mutation in the Ips locus. Specifically, there is a mutation in the TLR-4 
receptor that is expressed by the macrophages of these compromised mice and this mutation 
leads to a reduced recognition of endotoxin. As the endotoxin is not recognized by these 
defense cells, there is a lack of immune response mounted against the invading bacteria which 
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results in the deleterious effect of the infection. The present invention suggest that similar 
mechanisms work in other mammalian cells and as such in a broad sense the present invention 
provides methods of preventing a bacterial infection of a host comprising ensuring that the 
macrophages of the host express a function TLR-4 or Toll-like receptor. By providing such a 
5 functional receptor, the present invention ensures that the endotoxin signal is recognized by the 
immune system of the host. Conversely, in those instances in which widespread activation of 
macrophages by endotoxin results in the overproduction of TNF leading to the development of 
septic shock, it may be desirable to down-regulate the TLR-4 receptor. 

10 Thus, the present invention also provides a method of treating gram-negative bacterial 

infections comprising administration of an agent that stimulates the recognition of endotoxin. In 
specific circumstances such an agent is likely to by a candidate substance that stimulates the 
:f) expression, activity or function of the TLR-4 receptor that is expressed by the macrophages of 
;2 the host. Similarly, methods and compositions for treating endotoxin related symptoms 
^ 15 comprising administration of an agent that reduces the recognition of endotoxin also are 
yj provided by the present invention. A useful compound that may be identified by the present 
invention is one which inhibits endotoxin signaling by binding to a TLR-4 receptor and 
y competing with endotoxin for the binding. Particularly preferred agents would be those that 
|^ modulate the stimulation of TNF and cytokine secretion. Such a modulation may be an increase 
JJ 20 in secretion in circumstances where the endotoxin has not been recognized or a decrease in 
^ secretion in circumstances where there has been a deleterious production of TNF and/or other 
cytokines. 

Thus, as outlined above and described in detail herein below, the TLR-4 sequence will 
25 find utility in a variety of applications in bacterial infection susceptibility detection, diagnosis, 
prognosis and treatment. Examples of such applications within the scope of the present invention 
include amplification of markers of LPS mediated infections using specific primers; detection of 
markers of TLR-4 by hybridization with oligonucleotide probes; incorporation of isolated nucleic 
acids into vectors and expression of vector-incorporated nucleic acids as RNA and protein; 
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development of immunologic reagents corresponding to gene encoded products; and therapeutic 
treatment for the identified infection using these reagents as well as, anti-sense nucleic acids, or 
other inhibitors specific for the identified disease. The present invention further discloses 
screening assays for compounds to upregulate gene expression or to combat the effects of the 
5 mutant TLR-4 genes. 

B. LPS Mutation is Responsible for Susceptibility to Bacterial Infection 

The Lps is an important susceptibility locus, influencing the propensity to develop a 
disseminated Gram negative infection, or the outcome of such an infection. Hence, C3H/HeJ 
10 mice, while highly resistant to LPS, show exaggerated susceptibility to infection by Gram 
negative organisms (O'Brien et ai, 1980; Macela et ai, 1996). In birds, resistance to Salmonella 
typhimurium is linked to a polymorphism at the tenascin locus (Hu et ai, 1997); tenascin is 
closely linked to the Lps gene, and it may be assumed that a mutation of the avian Lps locus lies 
in linkage disequilibrium with the tenascin marker, yielding the reported association. From 
15 these investigations it is likely that mutations at the equivalent locus in humans also will 
influence the course of Gram negative infection. 

The Lps critical region is remarkably gene-poor. While the average megabase of 
mammalian DNA contains approximately 30 genes, only one authentic gene (and a portion of a 

ITi 

^ 20 second gene) have been detected within 2.6 Mb of DNA flanking Lps. As virtually all of the Lps 
% « critical region was sequenced herein, and no other plausible candidates were found, it must be 
considered that the lone candidate, encoding the toll-like receptor 4 (TLR-4; Tlr-4; TIL4) of 
mice, very likely represents the Lps locus. Moreover, the inventors suggest that a specific 
mutation of this gene is responsible for the endotoxin-unresponsive phenotype witnessed in 
25 C3H/HeJ mice. 

The toll family of receptors (Chaudhary et ai, 1998; Rock et ai, 1998) is defined by 
homology to the Drosophila toll protein, a plasma membrane receptor which engages an 
extracellular mediator encoded by spdzle, leading to activation of a rel gene family member, by 
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inducing its dissociation from cactus. This sequence of events is important for induction of the 
drosomycin antifungal response in Drosophila (Rosetto et ai, 1995; Lemaitre et ai, 1996). The 
mammalian IL-1 receptor is a member of the toll family of proteins, and four other mammalian 
family members (Toll-like receptors 1 through 4) have been identified by molecular cloning, 
though their function is uncertain. IL-1 signaling involves, among other events, the activation of 
NF-kB, which like dorsal, is a member of the rel family. LPS signaling also entails activation 
of NF-kB. As such, it is plausible to consider that LPS signaling might involve transduction via 
a toll family member. The present invention, for the first time shows that TLR-4 is the receptor 
for LPS. 



Further evidence consistent with this hypothesis may be seen in the clinical observation 
of Kuhns and coworkers, who determined that a profound immunodeficiency results from a 
conjoint defect in responses to IL-1 and LPS (Kuhns et ai, 1997). In view of the inventor's 
findings, the fact that a single mutation may block signal transduction initiated by both IL-1 and 
15 LPS may be taken to indicate that a common (proximal) mediator serves both the IL-1 receptor 
and the LPS receptor. This, in turn, would suggest the existence of structural similarity between 
the BL-1 and LPS receptors. While the details of signal transduction via toll family members 
have not been fully elucidated, the involvement of MyD88, IRAK, and TRAF6 has recently been 
proposed in the case of the TLR-4 receptor (Muzio et ai, 1998). 



The Lps d mutation has a codominant character, and attempts to identify the product of 



Lps through expression cDNA cloning in C3H/HeJ macrophages were unsuccessful. Blockade 
of endotoxin signal transduction in the C3H/HeJ mice may therefore reflect the expression of a 
protein with dominant negative characteristics. As CD14 serves as the physical receptor for LPS 
25 on the cell surface yet lacks a transmembrane domain, it would seem likely that CD 14 engages 
TLR-4, and that the latter protein acts to transduce the LPS signal across the membrane. It is 
possible that the Lps d mutation leads to unproductive interaction between CD 14 and an TLR-4, 
preventing signal transduction through other components of the signaling pathway. 
Alternatively, the mutation may merely abolish signal transduction through TLR-4 itself. 
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Examination of these hypotheses will depend upon the demonstration of a mutational difference 
between the TLR-4 gene in C3H/HeJ mice and in C3H/HeN animals. 

C. TLR-4 Polypeptides 

TLR-4 may be obtained according to various standard methodologies that are known to 
those of skill in the art. For example, antibodies specific for TLR-4 may be used in 
immunoaffinity protocols to isolate TLR-4 from cells. Antibodies are advantageously bound to 
supports, such as columns or beads, and the immobilized antibodies can be used to pull the 
TLR-4 target out of the cell lysate. Size fractionation (chromatography, centrifugation), ion 
exchange or affinity chromatograph, and even gel purification may be used for purification as 
well. 

TLR-4, according to the present invention, may advantageously be cleaved into fragments 
for use in further structural or functional analysis, or in the generation of reagents such as TLR-4- 
related polypeptides and TLR-4-specific antibodies. This can be accomplished by treating purified 
or unpurified TLR-4 with a peptidase such as endoproteinase glu-C (Boehringer, Indianapolis, IN). 
Treatment with CNBr is another method by which TLR-4 fragments may be produced from natural 
TLR-4. Recombinant techniques also can be used to produce specific fragments of TLR-4. 

In addition to the peptidyl compounds described herein, the inventors also contemplate that 
other sterically similar compounds may be formulated to mimic the key portions of the peptide 
structure, called peptidomimetics. Mimetics are peptide-containing molecules which mimic 
elements of protein secondary structure. See, for example, Johnson et al (1993). The underlying 
rationale behind the use of peptide mimetics is that the peptide backbone of proteins exists chiefly 
to orient amino acid side chains in such a way as to facilitate molecular interactions, such as those 
of receptor and ligand. 

Successful applications of the peptide mimetic concept have thus far focused on mimetics 
of P-turns within proteins. Likely P-turn structures within TLR-4 can be predicted by computer- 
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based algorithms as discussed above. Once the component amino acids of the turn are determined, 
mimetics can be constructed to achieve a similar spatial orientation of the essential elements of the 
amino acid side chains, as discussed in Johnson et al (1993). 

5 D. DNA Segments 

Important aspects of the present invention concern isolated DNA segments and 
recombinant vectors encoding TLR-4, and the creation and use of recombinant host cells 
through the application of DNA technology, that express TLR-4 for the purposes of increasing 
the recognition of endotoxin by a host cell. The present invention shows that the long sought 
10 after gene at the Ips locus is TLR-4. The TLR-4 a receptor for LPS and is thus the protein that 
binds to and recognizes LPS from the bacteria cell wall. It is recognition of the endotoxin by the 
TLR-4 that allows an animal to mount an immune response against the invading bacteria. If the 
y TLR-4 peptide is mutated such that it is unable to recognize the LPS the animal will not be able 
y to mount an immune response. 

1 15 

;J TLR-4 is a member of the IL-1 family of receptors. U. S. Patent 5,786,331; U. S. Patent 

m 5,776,731; U. S. Patent 5,767,234; U. S. Patent 5,767,064; U. S. Patent 5,726,148; U. S. Patent 

j= 5,608,035; U. S. Patent 5,508,262; U. S. Patent 5,488,032; U. S. Patent 5,464,937 each 

j*f specifically incorporated herein by reference, describe the IL-1 receptor and methods and 

|fi 20 compositions related to modulating the activity thereof. The present invention contemplates 

% using techniques and compositions similar to those described in these patents for use with the 

'XT 

TLR-4 receptor of the present invention. 

DNA segments, recombinant vectors, recombinant host cells and expression methods 
25 using sequences of the human TLR-4 (SEQ ID NO:l, SEQ ID NO:3 and SEQ ID NO:47), rat 
(SEQ ID NO:5) and mouse (SEQ ID NO:45, SEQ ID NO:46 and SEQ ID NO:48) also are 
provided. These sequences express human polypeptides of SEQ ID NO:2, SEQ ID NO:4 and 
SEQ ID NO:98, rat polypeptide of SEQ ID NO:6, and mouse polypeptide of SEQ ID NO:99, 
respectively. Each of the foregoing genes are included within all aspects of the following 
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description. The present invention concerns DNA segments, isolatable from mammalian and 
human cells, that are free from total genomic DNA and that are capable of expressing a 
functional TLR-4 protein. As used herein, the term "DNA segment" refers to a DNA molecule 
that has been isolated free of other genomic DNA of a particular species. Therefore, a DNA 
segment encoding a TLR-4 protein refers to a DNA segment that contains TLR-4 protein coding 
sequences yet is isolated away from, or purified free from, total mammalian or human genomic 
DNA. Included within the term "DNA segment", are DNA segments and smaller fragments of 
such segments, and also recombinant vectors, including, for example, plasmids, cosmids, phage, 
viruses, and the like. 

Similarly, a DNA segment comprising an isolated or purified TLR-4 protein or subunit 
gene refers to a DNA segment including purified TLR-4 protein or subunit protein coding 
sequences and, in certain aspects, regulatory sequences, isolated substantially away from other 
naturally occurring genes or protein encoding sequences. In this respect, the term "gene" is used 
for simplicity to refer to a functional protein, polypeptide or peptide encoding unit. As will be 
understood by those in the art, this functional term includes both genomic sequences, 
complementary DNA (cDNA) sequences and smaller engineered gene segments that express, or 
may be adapted to express, TLR-4 proteins, polypeptides, domains, peptides, fusion proteins and 
mutants. 

"Isolated substantially away from other coding sequences" means that the gene of 
interest, in this case a TLR-4 protein gene, forms the significant part of the coding region of the 
DNA segment, and that the DNA segment does not contain large portions of naturally-occurring 
coding DNA, such as large chromosomal fragments or other functional genes or cDNA coding 
regions. Of course, this refers to the DNA segment as originally isolated, and does not exclude 
genes or coding regions later added to the segment by the hand of man. 

In particular embodiments, the invention concerns isolated DNA segments and 
recombinant vectors incorporating DNA sequences that encode a TLR-4 protein or subunit that 
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includes within its amino acid sequence a contiguous amino acid sequence in accordance with, 
or essentially as set forth in, SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:98 or 
SEQ ID NO:99 corresponding to TLR-4 in humans, rat and mouse, respectively. Moreover, in 
other particular embodiments, the invention concerns isolated DNA segments and recombinant 
5 vectors that encode a TLR-4 protein or subunit that includes within its amino acid sequence the 
substantially full length protein sequence of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ 
IDNO:98orSEQIDNO:99. 

The term "a sequence essentially as set forth in SEQ ID NO:2, SEQ ID NO:4, SEQ ID 
NO:6, SEQ ID NO:98 or SEQ ID NO:99 " means that the sequence substantially corresponds to 
10 a portion of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:98 or SEQ ID NO:99 and 
has relatively few amino acids that are not identical to, or a biologically functional equivalent of, 
the amino acids of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:98 or SEQ ID 
ifl NO:99. The term "biologically functional equivalent" is well understood in the art and is further 
j)S defined in detail herein. Accordingly, sequences that have between about 85% and about 90%; 

15 or more preferably, between about 91% and about 95%; or even more preferably, between about 
B 96% and about 99%; of amino acids that are identical or functionally equivalent to the amino 
f acids of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:98 or SEQ ID NO:99 will be 
5 sequences that are "essentially as set forth in SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ 
M ID NO:98 or SEQ ID NO:99 provided the biological activity of the protein is maintained. 




In certain other embodiments, the invention concerns isolated DNA segments and 
recombinant vectors that include within their sequence a nucleic acid sequence essentially as set 
forth in SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:45, SEQ ID NO:46, SEQ ID 
NO:47 and SEQ ID NO:48. The term "essentially as set forth in SEQ ID NO:l, SEQ ID NO:3, 
25 SEQ ID NO:5, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47 and SEQ ID NO:48" is used in 
the same sense as described above and means that the nucleic acid sequence substantially 
corresponds to a portion of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:45, SEQ 
ID NO:46, SEQ ID NO:47 or SEQ ID NO:48 and has relatively few codons that are not 
identical, or functionally equivalent, to the codons of SEQ ID NO:l, SEQ ID NO:3, SEQ ID 
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N0:5, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47 or SEQ ID NO:48. Again, DNA 
segments that encode TLR-4 or related proteins or subunits will be most preferred. 

The term "functionally equivalent codon" is used herein to refer to codons that encode 
the same amino acid, such as the six codons for arginine or serine, and also refers to codons that 
encode biologically equivalent amino acids (see Codon Table, below). 
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CODON TABLE 



Amino Acids 


Codons 


Alanine 

i lluillllv 


Ala 


A 


GCA 


GCC 


GCG 


GCU 






Cvsteine 


Cys 


c 


UGC 


UGU 










A^nartir arid 


Asp 


D 


GAC 


GAU 










vjiuicuiiiV' uv<iu 


Glu 




GAA 


GAG 










Phpnvlataninp 


Phe 

£ lit/ 


F 




T TT TIT 












niv 


ri 


GGA 


GGC 


VJVJVJ 


GG\] 






TTictiHinp 


nio 


j.1 


CAC 


CAU 










Tco1pnr*inp 

15U1CUI/111G 


Tie 

Alt 


T 


AUA 


AUC 










T vsine 


Lys 


K 


AAA 


AAG 










T jPiirinp 


Leu 


L 


UUA 


UUG 


CUA 


cue 


CUG 


cuu 


Methionine 


Met 


M 


AUG 












Asparagine 


Asn 


N 


AAC 


AAU 










Proline 


Pro 


P 


CCA 


CCC 


CCG 


ecu 






Glutamine 


Gin 


Q 


CAA 


CAG 










Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


CGG 


CGU 


Serine 


Ser 


S 


AGCAGU 


UCA 


ucc 


UCG 


UCU 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACU 






Valine 


Val 


V 


GUA 


GUC 


GUG 


GUU 






Tryptophan 


Trp 


W 


UGG 












Tyrosine 


Tyr 


Y 


UAC 


UAU 











M It will also be understood that amino acid and nucleic acid sequences may include 

U : 

additional residues, such as additional N- or C-terminal amino acids or 5' or 3* sequences, and 
yet still be essentially as set forth in one of the sequences disclosed herein, so long as the 
sequence meets the criteria set forth above, including the maintenance of biological protein 
10 activity where protein expression is concerned. The addition of terminal sequences particularly 
applies to nucleic acid sequences that may, for example, include various non-coding sequences 
flanking either of the 5' or 3' portions of the coding region or may include various internal 
sequences, i.e., introns, which are known to occur within genes. 



15 Excepting intronic or flanking regions, and allowing for the degeneracy of the genetic 

code, sequences that have between about 75% and about 79%; or more preferably, between 
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about 80% and about 89%; or even more preferably, between about 90% and about 99%; of 
nucleotides that are identical to the nucleotides of SEQ ED NO:l, SEQ ID NO:3, SEQ ID NO:5, 
SEQ ID NO:45 or SEQ ID NO:46 will be sequences that are "essentially as set forth in SEQ ID 
NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47 or SEQ 
5 ID NO:48." 



Sequences that are essentially the same as those set forth in SEQ ID NO:l, SEQ ID 
NO:3, SEQ ID NO:5, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47 or SEQ ID NO:48 may 
also be functionally defined as sequences that are capable of hybridizing to a nucleic acid 
10 segment containing the complement of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID 
NO:45, SEQ ID NO:46, SEQ ID NO:47 or SEQ ID NO:48 under relatively stringent conditions. 
Suitable relatively stringent hybridization conditions will be well known to those of skill in the 
art. 



15 Suitable standard hybridization conditions for the present invention include, for example, 

hybridization in 50% formamide, 5x Denhardts' solution, 5x SSC, 25 mM sodium phosphate, 
0.1% SDS and 100 |ig/ml of denatured salmon sperm DNA at 42°C for 16 hours followed by 1 
hour sequential washes with O.lx SSC, 0.1% SDS solution at 60°C to remove the desired 
amount of background signal. Lower stringency hybridization conditions for the present 
g 20 invention include, for example, hybridization in 35% formamide, 5x Denhardts' solution, 5x 
SSC, 25 mM sodium phosphate, 0.1% SDS and 100 |ig/ml denatured salmon sperm DNA or E. 
coli DNA at 42°C for 16 hours followed by sequential washes with 0.8x SSC, 0.1% SDS at 
55°C. Those of skill in the art will recognize that conditions can be readily adjusted to obtain 
the desired level of stringency. 



25 



Naturally, the present invention also encompasses DNA segments that are 
complementary, or essentially complementary, to the sequence, set forth in SEQ ID NO:l, SEQ 
ID NO:3, SEQ ID NO:5, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47 or SEQ ID NO:48. 
Nucleic acid sequences that are "complementary" are those that are capable of base-pairing 
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according to the standard Watson-Crick complementarity rules. As used herein, the term 
"complementary sequences" means nucleic acid sequences that are substantially complementary, 
as may be assessed by the same nucleotide comparison set forth above, or as defined as being 
capable of hybridizing to the nucleic acid segment of SEQ ID NO:l, SEQ ID NO:3, SEQ ID 
5 NO:5, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47 or SEQ ID NO:48, under relatively 
stringent conditions such as those described immediately above. 

The nucleic acid segments of the present invention, regardless of the length of the coding 
sequence itself, may be combined with other DNA sequences, such as promoters, 
10 polyadenylation signals, additional restriction enzyme sites, multiple cloning sites, other coding 
segments, and the like, such that their overall length may vary considerably. It is therefore 
contemplated that a nucleic acid fragment of almost any length may be employed, with the total 
:|3 length preferably being limited by the ease of preparation and use in the intended recombinant 

y 

DNA protocol. 

'Is? 

^ 15 

m For example, nucleic acid fragments may be prepared that include a short contiguous 

stretch identical to or complementary to SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID 

5 NO:45, SEQ ID NO:46, SEQ ID NO:47 or SEQ ID NO:48, such as about 22-27 or about 32-36 

M nucleotides, and that are up to about 30,000 or 20,000, or about 10,000, or about 5,000 base 
20 pairs in length, with segments of about 3,000 being preferred in certain cases. DNA segments 

,y with total lengths of about 1,000, about 500, about 200, about 100 and about 50 base pairs in 
length (including all intermediate lengths) are also contemplated to be useful. 

It will be readily understood that "intermediate lengths," in these contexts, means any 
25 length between the quoted ranges, such as 22, 23, 24, 25, 26, 27, 28, 29, etc; 30, 31, 32, 33, 34, 
etc.; 50, 51, 52, 53, etc.; 100, 101, 102, 103, etc.; 150, 151, 152, 153, etc.; including all integers 
through the 200-500; 500-1,000; 1,000-2,000; 2,000-3,000; 3,000-5,000; 5,000-10,000 ranges, 
up to and including sequences of about 12,001, 12,002, 13,001, 13,002, 15,000, 20,000, 30,000 
and the like. 
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The various probes and primers designed around the disclosed nucleotide sequences of the 
present invention may be of any length. By assigning numeric values to a sequence, for example, 
the first residue is 1, the second residue is 2, etc., an algorithm defining all primers can be 
proposed: 

n to n + y 

where n is an integer from 1 to the last number of the sequence and y is the length of the primer 
minus one, where n + y does not exceed the last number of the sequence. Thus, for a 25-mer, 
the probes correspond to bases 1 to 25, 2 to 26, 3 to 27 ... and so on. For a 30-mer, the probes 
correspond to bases 1 to 30, 2 to 31, 3 to 32 ... and so on. For a 35-mer, the probes correspond 
to bases 1 to 35, 2 to 36, 3 to 37 ... and so on. 

It will also be understood that this invention is not limited to the particular nucleic acid 
and amino acid sequences of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:45, SEQ 
ID NO:46, SEQ ID NO:47 or SEQ ID NO:48. Recombinant vectors and isolated DNA segments 
may therefore variously include these coding regions themselves, coding regions bearing 
selected alterations or modifications in the basic coding region, or they may encode larger 
polypeptides that nevertheless include such coding regions or may encode biologically 
functional equivalent proteins or peptides that have variant amino acids sequences. 

The DNA segments of the present invention encompass biologically functional 
equivalent TLR-4 proteins. Such sequences may arise as a consequence of codon redundancy 
and functional equivalency that are known to occur naturally within nucleic acid sequences and 
the proteins thus encoded. Alternatively, functionally equivalent proteins or peptides may be 
created via the application of recombinant DNA technology, in which changes in the protein 
structure may be engineered, based on considerations of the properties of the amino acids being 
exchanged. Changes designed by man may be introduced through the application of site- 
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directed mutagenesis techniques, e.g., to introduce improvements to the antigenicity of the 
protein or to test mutants in order to examine transcription, elongation or Tat binding activity at 
the molecular level. 

5 One may also prepare fusion proteins and peptides, e.g., where the TLR-4 protein coding 

regions are aligned within the same expression unit with other proteins or peptides having 
desired functions, such as for purification or immunodetection purposes {e.g., proteins that may 
be purified by affinity chromatography and enzyme label coding regions, respectively). 

10 Encompassed by the invention are DNA segments encoding relatively small peptides, 

such as, for example, peptides of from about 15 to about 50 amino acids in length, and more 
£ ^ preferably, of from about 15 to about 30 amino acids in length; and also larger polypeptides up 
% B to and including proteins corresponding to the full-length sequences set forth in SEQ ID NO:2, 
2 SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:98 and SEQ ID NO:99 6. 
^ 15 

^ It is proposed that the DNA segments of the present invention may be employed for a 

i: variety of applications. For example, a particularly useful application concerns the recombinant 

% production of the individual subunits or proteins or peptides whose structure is derived from that 
?f of the subunits, or in the recombinant production of the holoenzyme following co-expression of 
*3 20 the two subunits. Additionally, the TLR-4 -encoding DNA segments of the present invention 
,=CT can also be used in the preparation of nucleic acid probes or primers, which can, for example, be 

used in the identification and cloning of TLR-4 genes or related genomic sequences, or in the 

study of subunit(s) expression, and the like. 

25 E. Immunologic Detection Methods 

In one embodiment, the diagnostic approach will be immunologic. The reagents will 
include antibodies to the TLR-4 and TLR-4 mutants, or fragments thereof, and will further include 
reagents capable of detecting an antibody immunoreactive with an such compound. Detection 
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methods include, but are not limited to ELISA, RIA and immunoblots, as discussed elsewhere in 
the specification. 



Antibodies against TLR-4 and TLR-4 mutants isolated using the methodology described 
5 will be useful in the present invention, primarily in assays for the detection of of individuals 
suspectible to Gram-negative infection. Means for preparing and characterizing antibodies are 
well known in the art (see, e.g., Antibodies: A Laboratory Manual, Cold Spring Harbor 
Laboratory, 1988; incorporated herein by reference). 

10 The methods for generating monoclonal antibodies (MAbs) generally begin along the 

same lines as those for preparing polyclonal antibodies. Briefly, a polyclonal antibody is 
_ prepared by immunizing an animal with an immunogenic composition in accordance with the 
■3 present invention and collecting antisera from that immunized animal. A wide range of animal 
;2 species can be used for the production of antisera. Typically the animal used for production of 
!jj 15 anti-antisera is a rabbit, a mouse, a rat, a hamster, a guinea pig or a goat. 

Immunogenic compositions of the invention include TLR-4, TLR-4 mutants or fragments 
y and the like. As is well known in the art, a given composition may vary in its immunogenicity. 
M It is often necessary therefore to boost the host immune system, as may be achieved by coupling 
20 a peptide or polypeptide immunogen to a carrier. Exemplary and preferred carriers are keyhole 
^ limpet hemocyanin (KLH) and bovine serum albumin (BSA). Other albumins such as 
ovalbumin, mouse serum albumin or rabbit serum albumin can also be used as carriers. Means 
for conjugating a compound to a carrier protein are well known in the art and include 
glutaraldehyde, m-maleimidobenzoyl-N-hydroxysuccinimide ester, carbodiimyde and bis- 
25 biazotized benzidine. 

As is also well known in the art, the immunogenicity of a particular immunogen 
composition can be enhanced by the use of non-specific stimulators of the immune response, 
known as adjuvants. Exemplary and preferred adjuvants include complete Freund's adjuvant (a 
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non-specific stimulator of the immune response containing killed Mycobacterium tuberculosis), 
incomplete Freund's adjuvants and aluminum hydroxide adjuvant. 

The amount of immunogen composition used in the production of polyclonal antibodies 
5 varies upon the nature of the immunogen as well as the animal used for immunization. A variety 
of routes can be used to administer the immunogen (subcutaneous, intramuscular, intradermal, 
intravenous and intraperitoneal). The production of polyclonal antibodies may be monitored by 
sampling blood of the immunized animal at various points following immunization. A second, 
booster injection, also may be given. The process of boosting and titering is repeated until a 
10 suitable titer is achieved. When a desired level of immunogenicity is obtained, the immunized 
animal can be bled and the serum isolated and stored, and/or the animal can be used to generate 
MAbs. 

% MAbs may be readily prepared through use of well-known techniques, such as those 

15 exemplified in U.S. Patent 4,196,265, incorporated herein by reference. Typically, this 
m technique involves immunizing a suitable animal with a selected immunogen composition, e.g., 
j ss a purified or partially purified protein, polypeptide or peptide. The immunizing composition is 
y administered in a manner effective to stimulate antibody producing cells. Rodents such as mice 
|=£ and rats are preferred animals, however, the use of rabbit, sheep frog cells is also possible. The 
?JJ 20 use of rats may provide certain advantages, but mice are preferred, with the BALB/c mouse 
^ being most preferred as this is most routinely used and generally gives a higher percentage of 
stable fusions. 

Following immunization, somatic cells with the potential for producing antibodies, 
25 specifically B lymphocytes (B cells), are selected for use in the MAb generating protocol. These 
cells may be obtained from biopsied spleens, tonsils or lymph nodes, or from a peripheral blood 
sample. Spleen cells and peripheral blood cells are preferred, the former because they are a rich 
source of antibody-producing cells that are in the dividing plasmablast stage, and the latter 
because peripheral blood is easily accessible. Often, a panel of animals will have been 
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immunized and the spleen of animal with the highest antibody titer will be removed and the 
spleen lymphocytes obtained by homogenizing the spleen with a syringe. Typically, a spleen 
from an immunized mouse contains approximately 5x10 to 2 x 10 lymphocytes. 



5 The antibody-producing B lymphocytes from the immunized animal are then fused with 

cells of an immortal myeloma cell, generally one of the same species as the animal that was 
immunized. Myeloma cell lines suited for use in hybridoma-producing fusion procedures 
preferably are non-antibody-producing, have high fusion efficiency, and enzyme deficiencies 
that render then incapable of growing in certain selective media which support the growth of 
10 only the desired fused cells (hybridomas). 

Any one of a number of myeloma cells may be used, as are known to those of skill in the 
art (Campbell, pp. 75-83, 1984). For example, where the immunized animal is a mouse, one 
may use P3-X63/Ag8, X63-Ag8.653, NSl/l.Ag 4 1, Sp210-Agl4, FO, NSO/U, MPC-11, 
15 MPC11-X45-GTG 1.7 and S194/5XX0 Bui; for rats, one may use R210.RCY3, Y3-Ag 1.2.3, 
IR983F and 4B210; and U-266, GM1500-GRG2, LICR-LON-HMy2 and UC729-6 are all useful 
in connection with human cell fusions. 



Methods for generating hybrids of antibody-producing spleen or lymph node cells and 
20 myeloma cells usually comprise mixing somatic cells with myeloma cells in a 2:1 proportion, 
though the proportion may vary from about 20:1 to about 1:1, respectively, in the presence of an 
agent or agents (chemical or electrical) that promote the fusion of cell membranes. Fusion 
methods using Sendai virus have been described by Kohler and Milstein (1975; 1976), and those 
using polyethylene glycol (PEG), such as 37% (v/v) PEG, by Gefter et al (1977). The use of 
25 electrically induced fusion methods is also appropriate. 



Fusion procedures usually produce viable hybrids at low frequencies, about 1 X 10* 6 to 

Q 

1 X 10" . However, this does not pose a problem, as the viable, fused hybrids are differentiated 
from the parental, unfused cells (particularly the unfused myeloma cells that would normally 
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continue to divide indefinitely) by culturing in a selective medium. The selective medium is 
generally one that contains an agent that blocks the de novo synthesis of nucleotides in the tissue 
culture media. Exemplary and preferred agents are aminopterin, methotrexate, and azaserine. 
Aminopterin and methotrexate block de novo synthesis of both purines and pyrimidines, whereas 
5 azaserine blocks only purine synthesis. Where aminopterin or methotrexate is used, the media is 
supplemented with hypoxanthine and thymidine as a source of nucleotides (HAT medium). 
Where azaserine is used, the media is supplemented with hypoxanthine. 

The preferred selection medium is HAT. Only cells capable of operating nucleotide 
10 salvage pathways are able to survive in HAT medium. The myeloma cells are defective in key 
enzymes of the salvage pathway, e.g., hypoxanthine phosphoribosyl transferase (HPRT), and 
they cannot survive. The B cells can operate this pathway, but they have a limited life span in 
ig culture and generally die within about two weeks. Therefore, the only cells that can survive in 

i ts : 

^ the selective media are those hybrids formed from myeloma and B cells. 
| 15 

vq This culturing provides a population of hybridomas from which specific hybridomas are 

" selected. Typically, selection of hybridomas is performed by culturing the cells by single-clone 

Q dilution in microtiter plates, followed by testing the individual clonal supernatants (after about 

\& two to three weeks) for the desired reactivity. The assay should be sensitive, simple and rapid. 

; t: 20 Radioimmunoassays, enzyme immunoassays, cytotoxicity assays, plaque assays, dot 

«3 immunobinding assays, and the like, may be used. 

Where one desires to generate an antibody with defined activity, one would generally 
screen the candidate hybridomas to identify those hybridomas that produce antibodies that have 
25 the desired inhibitory or stimulatory properties. Any selected hybridomas would then be serially 
diluted and cloned into individual antibody-producing cell lines, which clones can then be 
propagated indefinitely to provide MAbs. The cell lines may be exploited for MAb production 
in two basic ways. A sample of the hybridoma can be injected (often into the peritoneal cavity) 
into a histocompatible animal of the type that was used to provide the somatic and myeloma 
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cells for the original fusion. The injected animal develops tumors secreting the specific 
monoclonal antibody produced by the fused cell hybrid. The body fluids of the animal, such as 
serum or ascites fluid, can then be tapped to provide MAbs in high concentration. The 
individual cell lines could also be cultured in vitro, where the MAbs are naturally secreted into 
the culture medium from which they can be readily obtained in high concentrations. MAbs 
produced by either means may be further purified, if desired, using filtration, centrifugation and 
various chromatographic methods such as HPLC or affinity chromatography. 

Certain aspects of the present invention relates to the detection of TLR-4 and mutants 
thereof. One method of detecting such compounds uses immunoassays for agents of the present 
invention. Antibodies that recognize TLR-4 or TLR-4 mutants of the present invention are 
contemplated to be useful in these immunoassays. 

Immunoassays encompassed by the present invention include, but are not limited to those 
described in U.S. Patent No. 4,367,110 (double monoclonal antibody sandwich assay) and U.S. 
Patent No. 4,452,901 (western blot). Other assays include immunoprecipitation of labeled 
ligands and immunocytochemistry, both in vitro and in vivo. 

Immunoassays, in their most simple and direct sense, are binding assays. Certain 
preferred immunoassays are the various types of enzyme linked immunosorbent assays (ELISAs) 
and radioimmunoassays (RIA) known in the art. Immunohistochemical detection using tissue 
sections is also particularly useful. 

In one exemplary ELISA, the antibodies of the invention are immobilized onto a selected 
surface exhibiting protein affinity, such as a well in a polystyrene microtiter plate. Then, a test 
composition suspected of containing the desired antigen, such as a clinical sample, is added to 
the wells. After binding and washing to remove non-specifically bound immune complexes, the 
bound antigen may be detected. Detection is generally achieved by the addition of another 
antibody, specific for the desired antigen, that is linked to a detectable label. This type of ELISA 
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is a simple "sandwich ELISA". Detection may also be achieved by the addition of a second 
antibody specific for the desired antigen, followed by the addition of a third antibody that has 
binding affinity for the second antibody, with the third antibody being linked to a detectable 
label. 

5 

In another exemplary ELISA, the samples suspected of containing the desired antigen are 
immobilized onto the well surface and then contacted with the antibodies of the invention. After 
binding and appropriate washing, the bound immune complexes are detected. Where the initial 
antigen specific antibodies are linked to a detectable label, the immune complexes may be 
10 detected directly. Again, the immune complexes may be detected using a second antibody that 
has binding affinity for the first antigen specific antibody, with the second antibody being linked 
to a detectable label. 



■J Competition ELISAs are also possible in which test samples compete for binding with 

yl 15 known amounts of labeled antigens or antibodies. The amount of reactive species in the 
g3 unknown sample is determined by mixing the sample with the known labeled species before or 
~ 2 during incubation with coated wells. The presence of reactive species in the sample acts to 
0 reduce the amount of labeled species available for binding to the well and thus reduces the 
U ultimate signal. 
5 20 

sQ Irrespective of the format employed, ELISAs have certain features in common, such as 

coating, incubating or binding, washing to remove non-specifically bound species, and detecting 
the bound immune complexes. These are described as below. 

25 Antigen or antibodies may also be linked to a solid support, such as in the form of plate, 

beads, dipstick, membrane or column matrix, and the sample to be analyzed applied to the 
immobilized antigen or antibody. In coating a plate with either antigen or antibody, one will 
generally incubate the wells of the plate with a solution of the antigen or antibody, either 
overnight or for a specified period. The wells of the plate will then be washed to remove 
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incompletely adsorbed material. Any remaining available surfaces of the wells are then "coated" 
with a nonspecific protein that is antigenically neutral with regard to the test antisera. These 
include bovine serum albumin (BSA), casein and solutions of milk powder. The coating allows 
for blocking of nonspecific adsorption sites on the immobilizing surface and thus reduces the 
5 background caused by nonspecific binding of antisera onto the surface. 

In ELISAs, it is probably more customary to use a secondary or tertiary detection means 
rather than a direct procedure. Thus, after binding of the antigen or antibody to the well, coating 
with a non-reactive material to reduce background, and washing to remove unbound material, 
10 the immobilizing surface is contacted with the clinical or biological sample to be tested under 
conditions effective to allow immune complex (antigen/antibody) formation. Detection of the 
immune complex then requires a labeled secondary binding ligand or antibody, or a secondary 
iS binding ligand or antibody in conjunction with a labeled tertiary antibody or third binding 
^ ligand. 

m 15 

ay 

ng "Under conditions effective to allow immune complex (antigen/antibody) formation" 

ws means that the conditions preferably include diluting the antigens and antibodies with solutions 

51 

□ such as BSA, bovine gamma globulin (BGG) and phosphate buffered saline (PBS)/Tween. 
|^ These added agents also tend to assist in the reduction of nonspecific background. 
5 20 

*D The suitable conditions also mean that the incubation is at a temperature and for a period 

of time sufficient to allow effective binding. Incubation steps are typically from about 1 to 2 to 
4 hours, at temperatures preferably on the order of 25° to 27°C, or may be overnight at about 
4°C or so. 

25 

Following all incubation steps in an ELISA, the contacted surface is washed so as to 
remove non-complexed material. Washing often includes washing with a solution of 
PBS/Tween, or borate buffer. Following the formation of specific immune complexes between 
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the test sample and the originally bound material, and subsequent washing, the occurrence of 
even minute amounts of immune complexes may be determined. 

To provide a detecting means, the second or third antibody will have an associated label 
5 to allow detection. Preferably, this will be an enzyme that will generate color development upon 
incubating with an appropriate chromogenic substrate. Thus, for example, one will desire to 
contact and incubate the first or second immune complex with a urease, glucose oxidase, 
alkaline phosphatase or hydrogen peroxidase-conjugated antibody for a period of time and under 
conditions that favor the development of further immune complex formation, e.g., incubation for 
10 2 hours at room temperature in a PBS-containing solution such as PBS-Tween. 

After incubation with the labeled antibody, and subsequent to washing to remove 
!fS unbound material, the amount of label is quantified, e.g., by incubation with a chromogenic 

3 .. : 

^ substrate such as urea and bromocresol purple or 

(ft 15 2,2'-azino-di-(3-ethyl-benzthiazoline-6-sulfonic acid (ABTS) and H2O2, in the case of 

*Jj peroxidase as the enzyme label. Quantification is then achieved by measuring the degree of 
color generation, e.g., using a visible spectra spectrophotometer. 

s; 

3^ Alternatively, the label may be a chemiluminescent one. The use of such labels is 

^ 20 described in U.S. Patent Nos. 5,310,687, 5,238,808 and 5,221,605. 

iQ 

■ iFi 

Assays for TLR-4 and TLR-4 mutants of the present invention also can determine 
normal/abnormal tissue distribution for diagnostic purposes. Methods for in vitro and in situ 
analysis are well known and involve assessing binding of antigen-specific antibodies to tissues, 
25 cells or cell extracts. These are conventional techniques well within the grasp of those skilled in 
the art. For example, the antibodies of the present invention may be used in conjunction with 
both fresh-frozen and formalin-fixed, paraffin-embedded tissue blocks prepared for study by 
immunohistochemistry (IHC). Each tissue block may consist of 50 mg of residual "pulverized" 
prostate tumor. The method of preparing tissue blocks from these particulate specimens has 
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been successfully used in previous IHC studies of various prognostic factors, e.g., in breast 
cancer, and is well known to those of skill in the art. (Abbondanzo et aL, 1990; Allred et aL, 
1990; Brown etal, 1990) 



5 Briefly, frozen-sections may be prepared by rehydrating 50 ng of frozen pulverized tumor 

at room temperature in PBS in small plastic capsules; pelleting the particles by centrifiigation; 
resuspending them in a viscous embedding medium (OCT); inverting the capsule and pelleting 
again by centrifiigation; snap-freezing in -70°C isopentane; cutting the plastic capsule and 
removing the frozen cylinder of tissue; securing the tissue cylinder on a cryostat microtome 
10 chuck; and cutting 25-50 serial sections containing an average of about 500 remarkably intact 
tumor cells. 

,S Permanent-sections may be prepared by a similar method involving rehydration of the 50 

S *5J nig sample in a plastic microfuge tube; pelleting; resuspending in 10% formalin for 4 hours 
m 15 fixation; washing/pelleting; resuspending in warm 2.5% agar; pelleting; cooling in ice water to 
harden the agar; removing the tissue/agar block from the tube; infiltrating and embedding the 
!3?! block in paraffin; and cutting up to 50 serial permanent sections. 

Issl 
sjijj 

P F. Nucleic Acid Detection 

■H 20 In addition to their use in directing the expression of the TLR-4 protein, the nucleic acid 

*S sequences disclosed herein also have a variety of other uses. For example, they also have utility 
as probes or primers in nucleic acid hybridization embodiments. 

i. Hybridization 

25 The use of a hybridization probe of between 20 and 100 nucleotides in length allows the 

formation of a duplex molecule that is both stable and selective. Molecules having complementary 
sequences over stretches greater than 20 bases in length are generally preferred, in order to increase 
stability and selectivity of the hybrid, and thereby improve the quality and degree of particular 
hybrid molecules obtained. One will generally prefer to design nucleic acid molecules having 
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stretches of 20 to 30 nucleotides, or even longer where desired. Such fragments may be readily 
prepared by, for example, directly synthesizing the fragment by chemical means or by introducing 
selected sequences into recombinant vectors for recombinant production. These chemical means 
can include PCR ™ technology of U.S. Patent 4,603,102 (herein incorporated by reference) or by 
introducing selected sequences into recombinant vectors for recombinant production. 

Accordingly, the nucleotide sequences of the invention may be used for their ability to 
selectively form duplex molecules with complementary stretches of genes or RNAs or to provide 
primers for amplification of DNA or RNA from tissues. Depending on the application envisioned, 
one will desire to employ varying conditions of hybridization to achieve varying degrees of 
selectivity of probe towards target sequence. 

For certain applications, for example, substitution of nucleotides by site-directed 
mutagenesis, it is appreciated that lower stringency conditions are required. Under these 
conditions, hybridization may occur even though the sequences of probe and target strand are not 
perfectly complementary, but are mismatched at one or more positions. Conditions may be 
rendered less stringent by increasing salt concentration and decreasing temperature. For example, a 
low stringency hybridization conditions for the present invention of hybridization in 35% 
formamide, 5x Denhardts' solution, 5x SSC, 25 mM sodium phosphate, 0.1% SDS and 100 
|ug/ml denatured salmon sperm DNA or E. coli DNA at 42°C for 16 hours followed by 
sequential washes with 0.8x SSC, 0.1% SDS at 55°C allows for cross-species hybridization to 
homologous proteins to occur. Thus, hybridization conditions can be readily manipulated 
depending on the desired results. 

In other embodiments, hybridization may be achieved under conditions of, for example, 
50% formamide, 5x Denhardts' solution, 5x SSC, 25 mM sodium phosphate, 0.1% SDS and 100 
|ig/ml of denatured salmon sperm DNA at 42°C for 16 hours followed by 1 hour sequential 
washes with O.lx SSC, 0.1% SDS solution at 60°C to remove the desired amount of background 
signal. 
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In certain embodiments, it will be advantageous to employ nucleic acid sequences of the 
present invention in combination with an appropriate means, such as a label, for determining 
hybridization. A wide variety of appropriate indicator means are known in the art, including 
5 fluorescent, radioactive, enzymatic or other ligands, such as avidin/biotin, which are capable of 
being detected. In preferred embodiments, one may desire to employ a fluorescent label or an 
enzyme tag such as urease, alkaline phosphatase or peroxidase, instead of radioactive or other 
environmentally undesirable reagents. In the case of enzyme tags, colorimetric indicator substrates 
are known that can be employed to provide a detection means visible to the human eye or 
10 spectrophotometrically, to identify specific hybridization with complementary nucleic acid- 
containing samples. 



io In general, it is envisioned that the hybridization probes described herein will be useful 

both as reagents in solution hybridization, as in PCR™, for detection of expression of 

*u 

15 corresponding genes, as well as in embodiments employing a solid phase. In embodiments 

fg involving a solid phase, the test DNA (or RNA) is adsorbed or otherwise affixed to a selected 

if? 

™ : matrix or surface. This fixed, single-stranded nucleic acid is then subjected to hybridization with 
*j selected probes under desired conditions. The selected conditions will depend on the particular 
circumstances based on the particular criteria required (depending, for example, on the G+C 

m 

20 content, type of target nucleic acid, source of nucleic acid, size of hybridization probe, etc.). 
^ Following washing of the hybridized surface to remove non-specifically bound probe molecules, 
hybridization is detected, or even quantified, by means of the label. 



ii. Amplification and PCR m 
25 Nucleic acid used as a template for amplification is isolated from cells contained in the 

biological sample, according to standard methodologies (Sambrook et aL, 1989). The nucleic 
acid may be genomic DNA or fractionated or whole cell RNA. Where RNA is used, it may be 
desired to convert the RNA to a cDNA. 
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Pairs of primers that selectively hybridize to nucleic acids corresponding to a TLR-4 
protein or a mutant thereof are contacted with the isolated nucleic acid under conditions that 
permit selective hybridization. The term "primer", as defined herein, is meant to encompass any 
nucleic acid that is capable of priming the synthesis of a nascent nucleic acid in a template- 
5 dependent process. Typically, primers are oligonucleotides from ten to twenty base pairs in 
length, but longer sequences can be employed. Primers may be provided in double-stranded or 
single-stranded form, although the single-stranded form is preferred. 

Once hybridized, the nucleic acid:primer complex is contacted with one or more 
10 enzymes that facilitate template-dependent nucleic acid synthesis. Multiple rounds of 
amplification, also referred to as "cycles," are conducted until a sufficient amount of 
amplification product is produced. 

^ Next, the amplification product is detected. In certain applications, the detection may be 

l» 15 performed by visual means. Alternatively, the detection may involve indirect identification of 
=g the product via chemiluminescence, radioactive scintigraphy of incorporated radiolabel or 
" 5 fluorescent label or even via a system using electrical or thermal impulse signals (Affymax 
S technology). 

'I? 20 A number of template dependent processes are available to amplify the marker sequences 

ifi present in a given template sample. One of the best known amplification methods is the 

polymerase chain reaction (referred to as PCR™) which is described in detail in U.S. Patent Nos. 

4,683,195, 4,683,202 and 4,800,159, and each incorporated herein by reference in entirety. 

25 Briefly, in PCR™, two primer sequences are prepared that are complementary to regions 

on opposite complementary strands of the marker sequence. An excess of deoxynucleoside 
triphosphates are added to a reaction mixture along with a DNA polymerase, e.g., Taq 
polymerase. If the marker sequence is present in a sample, the primers will bind to the marker 
and the polymerase will cause the primers to be extended along the marker sequence by adding 
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on nucleotides. By raising and lowering the temperature of the reaction mixture, the extended 
primers will dissociate from the marker to form reaction products, excess primers will bind to 
the marker and to the reaction products and the process is repeated. 



5 A reverse transcriptase PCR™ (RT-PCR™) amplification procedure may be performed in 

order to quantify the amount of mRNA amplified or to prepare cDNA from the desired mRNA. 
Methods of reverse transcribing RNA into cDNA are well known and described in Sambrook et 
ai, 1989. Alternative methods for reverse transcription utilize thermostable, RNA-dependent 
DNA polymerases. These methods are described in WO 90/07641, filed December 21, 1990, 
10 incorporated herein by reference. Polymerase chain reaction methodologies are well known in 
the art. 

!?j Another method for amplification is the ligase chain reaction ("LCR"), disclosed in EPA 

fej No. 320 308, incorporated herein by reference in its entirety. In LCR, two complementary probe 
yl 15 pairs are prepared, and in the presence of the target sequence, each pair will bind to opposite 
ijj complementary strands of the target such that they abut. In the presence of a ligase, the two 
'~ ~ probe pairs will link to form a single unit. By temperature cycling, as in PCR™, bound ligated 
Q units dissociate from the target and then serve as "target sequences" for ligation of excess probe 
\& pairs. U.S. Patent 4,883,750 describes a method similar to LCR for binding probe pairs to a 
|tj 20 target sequence. 

Qbeta Replicase, described in PCT Application No. PCT/US87/00880, incorporated 
herein by reference, may also be used as still another amplification method in the present 
invention. In this method, a replicative sequence of RNA that has a region complementary to 
25 that of a target is added to a sample in the presence of an RNA polymerase. The polymerase will 
copy the replicative sequence that can then be detected. 

An isothermal amplification method, in which restriction endonucleases and ligases are 
used to achieve the amplification of target molecules that contain nucleotide 5'-[alpha-thio]- 
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triphosphates in one strand of a restriction site may also be useful in the amplification of nucleic 
acids in the present invention. 

Strand Displacement Amplification (SDA) is another method of carrying out isothermal 
5 amplification of nucleic acids which involves multiple rounds of strand displacement and 
synthesis, i.e., nick translation. A similar method, called Repair Chain Reaction (RCR), 
involves annealing several probes throughout a region targeted for amplification, followed by a 
repair reaction in which only two of the four bases are present. The other two bases can be 
added as biotinylated derivatives for easy detection. A similar approach is used in SDA. Target 
10 specific sequences can also be detected using a cyclic probe reaction (CPR). In CPR, a probe 
having 3' and 5' sequences of non-specific DNA and a middle sequence of specific RNA is 
hybridized to DNA that is present in a sample. Upon hybridization, the reaction is treated with 
jTj RNase H, and the products of the probe identified as distinctive products that are released after 
L ^ digestion. The original template is annealed to another cycling probe and the reaction is 
If! 15 repeated. 

Ul Still another amplification methods described in GB Application No. 2 202 328, and in 

Q PCT Application No. PCT/US 89/0 1025, each of which is incorporated herein by reference in its 
1^ entirety, may be used in accordance with the present invention. In the former application, 
J 2 20 "modified" primers are used in a PCR™-like, template- and enzyme-dependent synthesis. The 
?B primers may be modified by labeling with a capture moiety (e.g., biotin) and/or a detector moiety 
(e.g., enzyme). In the latter application, an excess of labeled probes are added to a sample. In 
the presence of the target sequence, the probe binds and is cleaved catalytically. After cleavage, 
the target sequence is released intact to be bound by excess probe. Cleavage of the labeled probe 
25 signals the presence of the target sequence. 

Other nucleic acid amplification procedures include transcription-based amplification 
systems (TAS), including nucleic acid sequence based amplification (NASBA) and 3SR 
Gingeras etai, PCT Application WO 88/10315, incorporated herein by reference. In NASBA, 
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the nucleic acids can be prepared for amplification by standard phenol/chloroform extraction, 
heat denaturation of a clinical sample, treatment with lysis buffer and minispin columns for 
isolation of DNA and RNA or guanidinium chloride extraction of RNA. These amplification 
techniques involve annealing a primer which has target specific sequences. Following 
5 polymerization, DNA/RNA hybrids are digested with RNase H while double stranded DNA 
molecules are heat denatured again. In either case the single stranded DNA is made fully double 
stranded by addition of second target specific primer, followed by polymerization. The double- 
stranded DNA molecules are then multiply transcribed by an RNA polymerase such as T7 or 
SP6. In an isothermal cyclic reaction, the RNA's are reverse transcribed into single stranded 
10 DNA, which is then converted to double stranded DNA, and then transcribed once again with an 
RNA polymerase such as T7 or SP6. The resulting products, whether truncated or complete, 
indicate target specific sequences. 

^ Davey et ai, EPA No. 329 822 (incorporated herein by reference in its entirety) disclose 

111 15 a nucleic acid amplification process involving cyclically synthesizing single-stranded RNA 
J ("ssRNA"), ssDNA, and double-stranded DNA (dsDNA), which may be used in accordance with 
lJi the present invention. The ssRNA is a template for a first primer oligonucleotide, which is 
□ elongated by reverse transcriptase (RNA-dependent DNA polymerase). The RNA is then 
l2 removed from the resulting DNA:RNA duplex by the action of ribonuclease H (RNase H, an 
u 2 20 RNase specific for RNA in duplex with either DNA or RNA). The resultant ssDNA is a 
*0 template for a second primer, which also includes the sequences of an RNA polymerase 
' promoter (exemplified by T7 RNA polymerase) 5' to its homology to the template. This primer 
is then extended by DNA polymerase (exemplified by the large "Klenow" fragment of E. coli 
DNA polymerase I), resulting in a double-stranded DNA ("dsDNA") molecule, having a 
25 sequence identical to that of the original RNA between the primers and having additionally, at 
one end, a promoter sequence. This promoter sequence can be used by the appropriate RNA 
polymerase to make many RNA copies of the DNA. These copies can then re-enter the cycle 
leading to very swift amplification. With proper choice of enzymes, this amplification can be 
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done isothermally without addition of enzymes at each cycle. Because of the cyclical nature of 
this process, the starting sequence can be chosen to be in the form of either DNA or RNA. 

Miller etai, PCT Application WO 89/06700 (incorporated herein by reference in its 
entirety) disclose a nucleic acid sequence amplification scheme based on the hybridization of a 
promoter/primer sequence to a target single-stranded DNA ("ssDNA") followed by transcription 
of many RNA copies of the sequence. This scheme is not cyclic, i.e., new templates are not 
produced from the resultant RNA transcripts. Other amplification methods include "RACE" and 
"one-sided PCR" (Frohman, 1990, incorporated by reference). 

Methods based on ligation of two (or more) oligonucleotides in the presence of nucleic 
acid having the sequence of the resulting "di-oligonucleotide", thereby amplifying the 
di-oligonucleotide, may also be used in the amplification step of the present invention. 

Following any amplification, it may be desirable to separate the amplification product 
from the template and the excess primer for the purpose of determining whether specific 
amplification has occurred. In one embodiment, amplification products are separated by 
agarose, agarose-acrylamide or polyacrylamide gel electrophoresis using standard methods. See 
Sambrook etal, 1989. 

Alternatively, chromatographic techniques may be employed to effect separation. There 
are many kinds of chromatography which may be used in the present invention: adsorption, 
partition, ion-exchange and molecular sieve, and many specialized techniques for using them 
including column, paper, thin-layer and gas chromatography. 

Amplification products must be visualized in order to confirm amplification of the 
marker sequences. One typical visualization method involves staining of a gel with ethidium 
bromide and visualization under UV light. Alternatively, if the amplification products are 
integrally labeled with radio- or fluorometrically-labeled nucleotides, the amplification products 
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can then be exposed to x-ray film or visualized under the appropriate stimulating spectra, 
following separation. 

In one embodiment, visualization is achieved indirectly. Following separation of 
5 amplification products, a labeled, nucleic acid probe is brought into contact with the amplified 
marker sequence. The probe preferably is conjugated to a chromophore but may be 
radiolabeled. In another embodiment, the probe is conjugated to a binding partner, such as an 
antibody or biotin, and the other member of the binding pair carries a detectable moiety. 

10 In one embodiment, detection is by Southern blotting and hybridization with a labeled 

probe. The techniques involved in Southern blotting are well known to those of skill in the art 
and can be found in many standard books on molecular protocols. See Sambrook et ai, 1989. 

'ft Briefly, amplification products are separated by gel electrophoresis. The gel is then contacted 
with a membrane, such as nitrocellulose, permitting transfer of the nucleic acid and non-covalent 

y! 15 binding. Subsequently, the membrane is incubated with a chromophore-conjugated probe that is 

m capable of hybridizing with a target amplification product. Detection is by exposure of the 

y 1 membrane to x-ray film or ion-emitting detection devices. 

Q 

if! 

{Z One example of the foregoing is described in U.S. Patent No. 5,279,721, incorporated by 

20 reference herein, which discloses an apparatus and method for the automated electrophoresis and 
%y transfer of nucleic acids. The apparatus permits electrophoresis and blotting without external 

manipulation of the gel and is ideally suited to carrying out methods according to the present 

invention. 

25 All the essential materials and reagents required for detecting TLR-4 protein markers in a 

biological sample may be assembled together in a kit. This generally will comprise preselected 
primers for specific markers. Also included may be enzymes suitable for amplifying nucleic 
acids including various polymerases (RT, Taq, etc), deoxynucleotides and buffers to provide the 
necessary reaction mixture for amplification, 
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Such kits generally will comprise, in suitable means, distinct containers for each 
individual reagent and enzyme as well as for each marker primer pair. Preferred pairs of primers 
for amplifying nucleic acids are selected to amplify the sequences specified in SEQ ID NO:l, 
5 SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47 or SEQ ID 
NO:48. 

In another embodiment, such kits will comprise hybridization probes specific for TLR-4 
protein chosen from a group including nucleic acids corresponding to the sequences specified in 
10 SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47 
or SEQ ID NO:48. Such kits generally will comprise, in suitable means, distinct containers for 
each individual reagent and enzyme as well as for each marker hybridization probe. 

^IJ 

;S iiu Other Assays 

^ 15 Other methods for genetic screening to accurately detect genetic changes which may be 

m caused by disease, such as bacterial infections that alter normal cellular production and 

lit 

processing, in genomic DNA, cDNA or RNA samples may be employed, depending on the 
^ specific situation. 

: : : 

'~q 20 For example, one method of screening for genetic variation is based on RNase cleavage 

^ of base pair mismatches in RNA/DNA and RNA/RNA heteroduplexes. As used herein, the term 
"mismatch" is defined as a region of one or more unpaired or mispaired nucleotides in a double- 
stranded RNA/RNA, RNA/DNA or DNA/DNA molecule. This definition thus includes 
mismatches due to insertion/deletion mutations, as well as single and multiple base point 
25 mutations. 

U.S. Patent No. 4,946,773 describes an RNase A mismatch cleavage assay that involves 
annealing single-stranded DNA or RNA test samples to an RNA probe, and subsequent 
treatment of the nucleic acid duplexes with RNase A. After the RNase cleavage reaction, the 
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RNase is inactivated by proteolytic digestion and organic extraction, and the cleavage products 
are denatured by heating and analyzed by electrophoresis on denaturing polyacrylamide gels. 
For the detection of mismatches, the single-stranded products of the RNase A treatment, 
electrophoretically separated according to size, are compared to similarly treated control 
5 duplexes. Samples containing smaller fragments (cleavage products) not seen in the control 
duplex are scored as +. 

Currently available RNase mismatch cleavage assays, including those performed 
according to U.S. Patent No. 4,946,773, require the use of radiolabeled RNA probes. Myers and 
10 Maniatis in U.S. Patent No. 4,946,773 describe the detection of base pair mismatches using 
RNase A. Other investigators have described the use of E. coli enzyme, RNase I, in mismatch 
assays. Because it has broader cleavage specificity than RNase A, RNase I would be a desirable 
*jL enzyme to employ in the detection of base pair mismatches if components can be found to 
W decrease the extent of non-specific cleavage and increase the frequency of cleavage of 
m 15 mismatches. The use of RNase I for mismatch detection is described in literature from Promega 
p Biotech. Promega markets a kit containing RNase I that is shown in their literature to cleave 
™I three out of four known mismatches, provided the enzyme level is sufficiently high. 
O 

The RNase protection assay was first used to detect and map the ends of specific mRNA 
&j 20 targets in solution. The assay relies on being able to easily generate high specific activity 
bO radiolabeled RNA probes complementary to the mRNA of interest by in vitro transcription. 
Originally, the templates for in vitro transcription were recombinant plasmids containing 
bacteriophage promoters. The probes are mixed with total cellular RNA samples to permit 
hybridization to their complementary targets, then the mixture is treated with RNase to degrade 
25 excess unhybridized probe. Also, as originally intended, the RNase used is specific for single- 
stranded RNA, so that hybridized double-stranded probe is protected from degradation. After 
inactivation and removal of the RNase, the protected probe (which is proportional in amount to 
the amount of target mRNA that was present) is recovered and analyzed on a polyacrylamide 
gel. 
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The RNase Protection assay was adapted for detection of single base mutations. In this 
type of RNase A mismatch cleavage assay, radiolabeled RNA probes transcribed in vitro from 
wild type sequences, are hybridized to complementary target regions derived from test samples. 
5 The test target generally comprises DNA (either genomic DNA or DNA amplified by cloning in 
plasmids or by PGR™), although RNA targets (endogenous mRNA) have occasionally been 
used. If single nucleotide (or greater) sequence differences occur between the hybridized probe 
and target, the resulting disruption in Watson-Crick hydrogen bonding at that position 
("mismatch") can be recognized and cleaved in some cases by single-strand specific 
10 ribonuclease. To date, RNase A has been used almost exclusively for cleavage of single-base 
mismatches, although RNase I has recently been shown as useful also for mismatch cleavage. 
There are recent descriptions of using the MutS protein and other DNA-repair enzymes for 
^ detection of single-base mismatches. 

p 15 G. Recombinant Vectors, Host Cells and Expression 

J2 Recombinant vectors form important further aspects of the present invention. The term 

( * s "expression vector or construct" means any type of genetic construct containing a nucleic acid 

O coding for a gene product in which part or all of the nucleic acid encoding sequence is capable of 

s: : 

^ being transcribed. The transcript may be translated into a protein, but it need not be. Thus, in 
\H 20 certain embodiments, expression includes both transcription of a gene and translation of a RNA 
into a gene product. In other embodiments, expression only includes transcription of the nucleic 
acid, for example, to generate antisense constructs. 

Particularly useful vectors are contemplated to be those vectors in which the coding 
25 portion of the DNA segment, whether encoding a full length protein or smaller peptide, is 
positioned under the transcriptional control of a promoter. A "promoter" refers to a DNA 
sequence recognized by the synthetic machinery of the cell, or introduced synthetic machinery, 
required to initiate the specific transcription of a gene. The phrases "operatively positioned", 
"under control" or "under transcriptional control" means that the promoter is in the correct 
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location and orientation in relation to the nucleic acid to control RNA polymerase initiation and 
expression of the gene. 

The promoter may be in the form of the promoter that is naturally associated with a TLR- 
5 4 protein gene, as may be obtained by isolating the 5' non-coding sequences located upstream of 
the coding segment or exon, for example, using recombinant cloning and/or PCR™ technology, 
in connection with the compositions disclosed herein (PCR™ technology is disclosed in U.S. 
Patent 4,683,202 and U.S. Patent 4,682,195, each incorporated herein by reference). 
Additionally, the toll-like receptors from Drosophila are well known to those of skill in the art. 
10 The promoter regions of these nucleic acids may be useful herein. 

In other embodiments, it is contemplated that certain advantages will be gained by 
=fi positioning the coding DNA segment under the control of a recombinant, or heterologous, 
;S promoter. As used herein, a recombinant or heterologous promoter is intended to refer to a 
ffj 15 promoter that is not normally associated with a TLR-4 protein gene in its natural environment. 
m Such promoters may include promoters normally associated with other genes, and/or promoters 
isolated from any other bacterial, viral, eukaryotic, or mammalian cell. 

M Naturally, it will be important to employ a promoter that effectively directs the 

?« 20 expression of the DNA segment in the cell type, organism, or even animal, chosen for 
^ expression. The use of promoter and cell type combinations for protein expression is generally 
known to those of skill in the art of molecular biology, for example, see Sambrook et al. (1989), 
incorporated herein by reference. The promoters employed may be constitutive, or inducible, 
and can be used under the appropriate conditions to direct high level expression of the 
25 introduced DNA segment, such as is advantageous in the large-scale production of recombinant 
proteins or peptides. 

At least one module in a promoter functions to position the start site for RNA synthesis. 
The best known example of this is the TATA box, but in some promoters lacking a TATA box, 
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such as the promoter for the mammalian terminal deoxynucleotidyl transferase gene and the 
promoter for the S V40 late genes, a discrete element overlying the start site itself helps to fix the 
place of initiation. 



5 Additional promoter elements regulate the frequency of transcriptional initiation. 

Typically, these are located in the region 30-1 10 bp upstream of the start site, although a number 
of promoters have been shown to contain functional elements downstream of the start site as 
well. The spacing between promoter elements frequently is flexible, so that promoter function is 
preserved when elements are inverted or moved relative to one another. In the thymidine kinase 
10 (tk) promoter, the spacing between promoter elements can be increased to 50 bp apart before 
activity begins to decline. Depending on the promoter, it appears that individual elements can 
function either co-operatively or independently to activate transcription. 

The particular promoter that is employed to control the expression of a nucleic acid is not 
n\ 15 believed to be critical, so long as it is capable of expressing the nucleic acid in the targeted cell. 
m Thus, where a human cell is targeted, it is preferable to position the nucleic acid coding region 
adjacent to and under the control of a promoter that is capable of being expressed in a human 
cell. Generally speaking, such a promoter might include either a human or viral promoter. 
Preferred promoters include those derived from HSV, including the HNFla promoter. Another 

■tf 20 preferred embodiment is the tetracycline controlled promoter. 

^y 

y3 

In various other embodiments, the human cytomegalovirus (CMV) immediate early gene 
promoter, the Simian virus 40 (SV40) early promoter and the Rous sarcoma virus long terminal 
repeat can be used to obtain high-level expression of transgenes. The use of other viral or 
25 mammalian cellular or bacterial phage promoters which are well-known in the art to achieve 
expression of a transgene is contemplated as well, provided that the levels of expression are 
sufficient for a given purpose. The following tables list several elements/promoters which may 
be employed, in the context of the present invention, to regulate the expression of TLR-4 protein 
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or subunit gene. This list is not intended to be exhaustive of all the possible elements involved 
in the promotion of transgene expression but, merely, to be exemplary thereof. 

Enhancers were originally detected as genetic elements that increased transcription from 
5 a promoter located at a distant position on the same molecule of DNA. This ability to act over a 
large distance had little precedent in classic studies of prokaryotic transcriptional regulation. 
Subsequent work showed that regions of DNA with enhancer activity are organized much like 
promoters. That is, they are composed of many individual elements, each of which binds to one 
or more transcriptional proteins. 

10 

The basic distinction between enhancers and promoters is operational. An enhancer 
region as a whole must be able to stimulate transcription at a distance; this need not be true of a 
promoter region or its component elements. On the other hand, a promoter must have one or 
^ more elements that direct initiation of RNA synthesis at a particular site and in a particular 
ffl 15 orientation, whereas enhancers lack these specificities. Promoters and enhancers are often 
2 overlapping and contiguous, often seeming to have a very similar modular organization. 

s:sa 
si i 

y Additionally any promoter/enhancer combination (as per the Eukaryotic Promoter Data 

'Q Base EPDB) could also be used to drive expression of a transgene. Use of a T3, T7 or SP6 
J *J 20 cytoplasmic expression system is another possible embodiment. Eukaryotic cells can support 
•D cytoplasmic transcription from certain bacterial promoters if the appropriate bacterial 

polymerase is provided, either as part of the delivery complex or as an additional genetic 

expression construct. 
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PROMOTER TABLE 



PROMOTER 


REFERENCES 


Immunoglobulin Heavy Chain 


Hanerji etal, 1983; Gilles etal, 1983; Grosschedl and Baltimore, 
1985; Imicretal, 1987; Weinberger etal, 1988; Kiledjian etal, 
1988;Porton etal, 1990 


Immunoglobulin Light Chain 


Queen and Baltimore, 1983; Picard and Schaffner, 1984 


T-Cell Receptor 


Luria et al, 1987, Winoto and Baltimore, 1989; Redondo et al, 1990 


HLA DQ a and DQ 6 


Sullivan and Peterlin, 1987 


B-Interferon 


Goodbourn et al, 1986; Fujita et al, 1987; Goodbourn and Maniatis, 
1985 


Interleukin-2 


Greene et al, 1989 


Interleukin-2 Receptor 


Greene a/., 1989; Lin et al, 1 990 


MHC Class II 5 


Koch era/., 1989 


MHC Class II HLA-DRa 


Sherman a/., 1989 


6-Actin 


Kawamoto a/., 1988; Ng etal, 1989 


Muscle Creatine Kinase 


Jaynes etal, 1988; Horhck and Benfield, 1989; Johnson etal, 1989a 


Prealbumin (Transthyretin) 


Costa etal, 1988 


Elastase / 


umitz €r at, iyo/ 


Metallothionein 


r^ann £f at., ivo /, uulotta ana Hamer, lyay 


Collagenase 


rinkert etal., iyiw, Angel etaL, iyo/ 


Albumin Gene 


Pinkertef a/., 1987, Tronche etal, 1989, 1990 


ot-Fetoprotein 


Godboutefa/., 1988 


T-Globin 


Bodine and Ley, 1987; Perez-Stable and Constantini, 1990 


B-Globin 


Trudel and Constantini, 1987 


e-fos 


Cohen etal, 1987 


c-HA-ras 


Triesman, 1986; Deschamps etal, 1985 


Insulin 


Edlund etal, 1985 


Neural Cell Adhesion Molecule 
(NCAM) 


Hirsch etal, 1990 


O-I-Antitrypain 


Latimer etal, 1990 


H2B (TH2B) Histone 


Hwang etal, 1990 
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GRP78) 

Rat Growth Hormone 
Human Serum Amyloid A (SAA) 
Troponin I (TN I) 
Platelet-Derived Growth Factor 
Duchenne Muscular Dystrophy 
SV40 



Polyoma 



Retroviruses 



Papilloma Vims 



Hepatitis B Virus 
Human Immunodeficiency Virus 



Cytomegalovirus 



Edbrookeeffl/., 1989 



Yutzey etal, 1989 
Pech etal,\%9 
Kla mut etal, 1990 

Banerii^U981;Moreau et al, 1981; Sleigh and Lockett, 1985; 
Firak and Subramanian, 1986; Herr and Clarke, 1986; Imbra and 
Karin, 1986; Kadesch and Berg, 1986; Wang and Calame 1986; 
Qnde k et al, 1987; Kuhl et al, 1987 Schaffner et al, 1988 

Swartzendruber and Lehman, 1975; Vasseur * *'™^« 
al 1980 1981; TyndelUM/., 1981; Dandoloef^/., 1983 deVUhers 
etal, 1984; Hen et al, 1986; Satake et al, 1988; Campbell and 
Vill arreal, 1988 

KrieglerandBotchan, 1982, 1983; Levinson et al 1982; Mcgr tf 
al 1983 1984a,b, 1988; Bosze et al, 1986; Miksicek et al, 1986, 
Celander'and Haseltine, 1987; Thiesen et al , 1988; Celander et al, 
1988; Cho i etai, 1988; Reisman and Rotter, 1989 
Campo et al, 1983; Lusky et al, 1983; Spandidos and Wilkie 1983; 
Spalholz etai, 1985; Lusky and Botchan, 1986; Cnpe etal 1987, 
Gloss et al, 1987; Hirochika et al, 1987, Stephens and Hentschel, 
1987; Glue etal, 1988 
Bulla and Siddiqui, 1986; Jameel and Siddiqui, 1986; Shaul and Ben- 
Lew , 1987; Spandau and Lee, 1988; Vannice and Levinson, 1988 

Muesing^/., 1987; Hauber and Cullan, m^kobovits et aL 
V988' Feng and Holland, 1988;Takebe etal, 1988; Rowen etal, 
mi Berkhout et al, 1989; Laspia et al, 1989; Sharp and 
Marciniak, 1989; Braddock et al, 1989 



Gibbon Ape Leukemia Virus 



Weber et al, 1984; Boshart et al, 1985; Foecking and Hofstetter, 
1986 ■ 



Holbrooke al, 1987;Quinn etal, 1989 
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ENHANCER TABLE 







Inducer 


References 




MT II 


Phorbol Ester (TFA) 
Heavy metals 


Palmiter et aL, 1982; Haslinger and Karin, 
1985; Searle etal, 1985; Stuart et aL, 
1985; Imagawa et aL, 1987; Karin ®, 
1987; Angel etaL, 1987b; McNeall etaL, 
1989 




MMTV (mouse mammary 
tumor virus) 


Glucocorticoids 


Huang etaL, 1981; Lee etaL, 1981; 
Majors and Varmus, 1983; Chandler et aL, 
1983; Lee etaL, 1984; Fontae/a/., 1985; 
Sakai etaL, 1986 




6-Interferon 


poly(rI)X 
poly(rc) 


Tavernier a/., 1983 




Adenovirus 5 E2 


Ela 


Imperiale and Nevins, 1984 




Collagenase 


Phorbol Ester (TP A) 


Angle era/., 1987a 




Stromelysin 


Phorbol Ester (TPA) 


Angle era/., 1987b 


! 5 : 


SV40 


Phorbol Ester (TFA) 


Angel era/., 1987b 


'•O 

jS=~ 


Munne MX Gene 


Interferon, Newcastle Disease 
Virus 




s™ 


/THY70 „ „ „ 

GRP78 Gene 


A23187 


t-» j ^ . j i Ann 

Resendez era/., 1988 




ct-2-Macroglobulin 


TT 

IL-6 


Kunz etai, 1989 


u 


Vimentin 


Serum 


Rittling etai, 1989 




MHC Class I Gene H-2kb 


Interferon 


Blanar era/., 1989 




HSP70 


Ela, SV40 Large T Antigen 


Taylor et aL, 1989; Taylor and Kingston, 
I990a,b 




Proliferin 


Phorbol Ester-TPA 


Mordacq and Linzer, 1989 




Tumor Necrosis Factor 


FMA 


Hensel etaL, 1989 




Thyroid Stimulating Hormone 
a Gene 


Thyroid Hormone 


Chatterjee etaL, 1989 
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Turning to the expression of the TLR-4 proteins of the present invention, once a suitable 
clone or clones have been obtained, whether they be cDNA based or genomic, one may proceed 
to prepare an expression system. The engineering of DNA segment(s) for expression in a 
prokaryotic or eukaryotic system may be performed by techniques generally known to those of 
5 skill in recombinant expression. It is believed that virtually any expression system may be 
employed in the expression of the proteins of the present invention. 

Both cDNA and genomic sequences are suitable for eukaryotic expression, as the host 
cell will generally process the genomic transcripts to yield functional mRNA for translation into 

10 protein. Generally speaking, it may be more convenient to employ as the recombinant gene a 
cDNA version of the gene. It is believed that the use of a cDNA version will provide advantages 
in that the size of the gene will generally be much smaller and more readily employed to 
transfect the targeted cell than will a genomic gene, which will typically be up to an order of 
magnitude larger than the cDNA gene. However, the possibility of employing a genomic 

15 version of a particular gene where desired is not excluded. 

In expression, one will typically include a polyadenylation signal to effect proper 
polyadenylation of the transcript. The nature of the polyadenylation signal is not believed to be 
crucial to the successful practice of the invention, and any such sequence may be employed. 
20 Preferred embodiments include the SV40 polyadenylation signal and the bovine growth 
hormone polyadenylation signal, convenient and known to function well in various target cells. 
Also contemplated as an element of the expression cassette is a terminator. These elements can 
serve to enhance message levels and to minimize read through from the cassette into other 
sequences. 



25 



A specific initiation signal also may be required for efficient translation of coding 
sequences. These signals include the ATG initiation codon and adjacent sequences. Exogenous 
translation^ control signals, including the ATG initiation codon, may need to be provided. One 
of ordinary skill in the art would readily be capable of determining this and providing the 
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necessary signals. It is well known that the initiation codon must be "in-frame" with the reading 
frame of the desired coding sequence to ensure translation of the entire insert. The exogenous 
translation^ control signals and initiation codons can be either natural or synthetic. The 
efficiency of expression may be enhanced by the inclusion of appropriate transcription enhancer 
5 elements. 

As used herein, the terms "engineered" and "recombinant" cells are intended to refer to a 
cell into which an exogenous DNA segment or gene, such as a cDNA or gene encoding a TLR-4 
protein has been introduced. Therefore, engineered cells are distinguishable from naturally 
10 occurring cells which do not contain a recombinantly introduced exogenous DNA segment or 
gene. Engineered cells are thus cells having a gene or genes introduced through the hand of 
man. Recombinant cells include those having an introduced cDNA or genomic gene, and also 
]s include genes positioned adjacent to a promoter not naturally associated with the particular 
w introduced gene. 

m 15 

To express a recombinant TLR-4 protein, whether mutant or wild-type, in accordance 
^ with the present invention one would prepare an expression vector that comprises a TLR-4 
□ protein-encoding nucleic acid under the control of one or more promoters. To bring a coding 
p sequence "under the control of* a promoter, one positions the 5' end of the transcription 
«j 20 initiation site of the transcriptional reading frame generally between about 1 and about 50 
if) nucleotides "downstream" of (i.e., 3' of) the chosen promoter. The "upstream" promoter 
stimulates transcription of the DNA and promotes expression of the encoded recombinant 
protein. This is the meaning of "recombinant expression" in this context. 

25 Many standard techniques are available to construct expression vectors containing the 

appropriate nucleic acids and transcriptional/translational control sequences in order to achieve 
protein or peptide expression in a variety of host-expression systems. Cell types available for 
expression include, but are not limited to, bacteria, such as E, coli and B. subtilis transformed 
with recombinant bacteriophage DNA, plasmid DNA or cosmid DNA expression vectors. 
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Certain examples of prokaryotic hosts are E. coli strain RR1, E. coli LE392, £. coliB, 
E. coli X 1776 (ATCC No. 31537) as well as E. coli W31 10 (F-, lambda-, prototrophic, ATCC 
No. 273325); bacilli such as Bacillus subtilis\ and other enterobacteriaceae such as Salmonella 
5 typhimuriwn, Serratia marcescens, and various Pseudomonas species. 

In general, plasmid vectors containing replicon and control sequences which are derived 
from species compatible with the host cell are used in connection with these hosts. The vector 
ordinarily carries a replication site, as well as marking sequences which are capable of providing 
10 phenotypic selection in transformed cells. For example, E. coli is often transformed using 
pBR322, a plasmid derived from an E. coli species. pBR322 contains genes for ampicillin and 
tetracycline resistance and thus provides easy means for identifying transformed cells. The pBR 
■jj plasmid, or other microbial plasmid or phage must also contain, or be modified to contain, 

y promoters which can be used by the microbial organism for expression of its own proteins. 

m 15 

J? In addition, phage vectors containing replicon and control sequences that are compatible 

131 with the host microorganism can be used as transforming vectors in connection with these hosts. 

i; TM 

Q For example, the phage lambda GEM -11 may be utilized in making a recombinant phage 
j~f vector which can be used to transform host cells, such as E. coli LE392. 
fj 20 

% r) Further useful vectors include pIN vectors and pGEX vectors, for use in generating 

glutathione S-transferase (GST) soluble fusion proteins for later purification and separation or 
cleavage. Other suitable fusion proteins are those with 6-galactosidase, ubiquitin, the like. 

25 Promoters that are most commonly used in recombinant DNA construction include the b- 

lactamase (penicillinase), lactose and tryptophan (trp) promoter systems. While these are the 
most commonly used, other microbial promoters have been discovered and utilized, and details 
concerning their nucleotide sequences have been published, enabling those of skill in the art to 
ligate them functionally with plasmid vectors. 
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The following details concerning recombinant protein production in bacterial cells, such 
as E. coli, are provided by way of exemplary information on recombinant protein production in 
general, the adaptation of which to a particular recombinant expression system will be known to 
5 those of skill in the art. 



Bacterial cells, for example, E. coli, containing the expression vector are grown in any of 
a number of suitable media, for example, LB. The expression of the recombinant protein may be 
induced, e.g., by adding IPTG to the media or by switching incubation to a higher temperature. 
10 After culturing the bacteria for a further period, generally of between 2 and 24 hours, the cells 
are collected by centrifugation and washed to remove residual media. 



The bacterial cells are then lysed, for example, by disruption in a cell homogenizer and 
!« centrifuged to separate the dense inclusion bodies and cell membranes from the soluble cell 
m 15 components. This centrifugation can be performed under conditions whereby the dense 
inclusion bodies are selectively enriched by incorporation of sugars, such as sucrose, into the 
Isl buffer and centrifugation at a selective speed. 

5-: 

{Z If the recombinant protein is expressed in the inclusion bodies, as is the case in many 

20 instances, these can be washed in any of several solutions to remove some of the contaminating 
=n host proteins, then solubilized in solutions containing high concentrations of urea (e.g. 8M) or 

chaotropic agents such as guanidine hydrochloride in the presence of reducing agents, such as 6- 

mercaptoethanol or DTT (dithiothreitol). 



25 Under some circumstances, it may be advantageous to incubate the protein for several 

hours under conditions suitable for the protein to undergo a refolding process into a 
conformation which more closely resembles that of the native protein. Such conditions 
generally include low protein concentrations, less than 500 mg/ml, low levels of reducing agent, 
concentrations of urea less than 2 M and often the presence of reagents such as a mixture of 
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reduced and oxidized glutathione which facilitate the interchange of disulfide bonds within the 
protein molecule. 

The refolding process can be monitored, for example, by SDS-PAGE, or with antibodies 
specific for the native molecule (which can be obtained from animals vaccinated with the native 
molecule or smaller quantities of recombinant protein). Following refolding, the protein can 
then be purified further and separated from the refolding mixture by chromatography on any of 
several supports including ion exchange resins, gel permeation resins or on a variety of affinity 
columns. 

For expression in Saccharomyces, the plasmid YRp7, for example, is commonly used. 
This plasmid already contains the trp\ gene which provides a selection marker for a mutant strain 
of yeast lacking the ability to grow in tryptophan, for example ATCC No. 44076 or PEP4-1. 
The presence of the trp\ lesion as a characteristic of the yeast host cell genome then provides an 
effective environment for detecting transformation by growth in the absence of tryptophan. 

Suitable promoting sequences in yeast vectors include the promoters for 
3-phosphoglycerate kinase or other glycolytic enzymes, such as enolase, glyceraldehyde-3- 
phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, glucoses- 
phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, 
phosphoglucose isomerase, and glucokinase. In constructing suitable expression plasmids, the 
termination sequences associated with these genes are also ligated into the expression vector 3' 
of the sequence desired to be expressed to provide polyadenylation of the mRNA and 
termination. 

Other suitable promoters, which have the additional advantage of transcription controlled 
by growth conditions, include the promoter region for alcohol dehydrogenase 2, isocytochrome 
C, acid phosphatase, degradative enzymes associated with nitrogen metabolism, and the 
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aforementioned glyceraldehyde-3-phosphate dehydrogenase, and enzymes responsible for 
maltose and galactose utilization. 

In addition to microorganisms, cultures of cells derived from multicellular organisms 
5 may also be used as hosts. In principle, any such cell culture is workable, whether from 
vertebrate or invertebrate culture. In addition to mammalian cells, these include insect cell 
systems infected with recombinant virus expression vectors (e.g., baculovirus); and plant cell 
systems infected with recombinant virus expression vectors (e.g., cauliflower mosaic virus, 
CaMV; tobacco mosaic virus, TMV) or transformed with recombinant plasmid expression 
10 vectors (e.g., Ti plasmid) containing one or more Toll protein coding sequences. 

In a useful insect system, Autograph californica nuclear polyhidrosis virus (AcNPV) is 

;S used as a vector to express foreign genes. The virus grows in Spodoptera frugiperda cells. The 

■5? TLR-4 protein coding sequences are cloned into non-essential regions (for example the 

IP 15 polyhedrin gene) of the virus and placed under control of an AcNPV promoter (for example the 

li polyhedrin promoter). Successful insertion of the coding sequences results in the inactivation of 

Ul the polyhedrin gene and production of non-occluded recombinant virus (i.e., virus lacking the 

D proteinaceous coat coded for by the polyhedrin gene). These recombinant viruses are then used 

1^ to infect Spodoptera frugiperda cells in which the inserted gene is expressed (e.g., U.S. Patent 

^ 20 No. 4,215,051, Smith, incorporated herein by reference). 

Examples of useful mammalian host cell lines are VERO and HeLa cells, Chinese 
hamster ovary (CHO) cell lines, W138, BHK, COS-7, 293, HepG2, 3T3, RIN and MDCK cell 
lines. In addition, a host cell strain may be chosen that modulates the expression of the inserted 
25 sequences, or modifies and processes the gene product in the specific fashion desired. Such 
modifications (e.g., glycosylation) and processing (e.g., cleavage) of protein products may be 
important for the function of the protein. 
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Dif ,ere„, ho, ceUs have charac.eris.ic and specific mechanisms fo, .he ^H—a, 
processing « ^. - P-n, Approve ceUs lines - hos, sysKn* can he chosen 
.censure the corrcc, modificadon and processing of .he foreign protein expressed. 

Expression vectors for use in mammalian such ceUs ordinal include an origin of 
repl ica.ion ,as necessary,, a promOer !oca«d in frou, of the gene ,o he expressed, aiong - 

L~ seouence, The origin of rephcarion may *. provided ei.her by cons— . e 
vecor .0 ,nc,„de an exogenous ortgin, such as ma, he derived from SV40 or Cher vr,at <e, 
10 ^yoma. Adeno, VSV. BPV, «, or ma, , provided by me hos, ceU chro— 
repLon mechanism. » ft. vector is in.egr.ed in.o .he hos. cell cnromosome, .he lauer 

often sufficient. 

S The promos may he derived from .he genome of mammal ceiis (e,, 

'S , ,i„„ „i„ lu ,. u t .he adenovirus late promoter, me 

A 15 metallothionein promoter) or from mammalran vruses (e.g., 

I vaccinia virus 7.5K promoter,. Further, it is also possihle, attd may be desrrable, to utrl*. 
W plter or control seance, normally ass.ia.ed wi.h .he desired TUM protem gene 
^nence, provided such con.ro, sequences are compatible wi,h .he hos, cell sys.ems. 

A number of vita, based expression sys,ems may be udlized, for exampie, commonly 
nsed promoters are derived from polyoma, Adenovirus 2, - mos. frequency Simian V» « 
(Svl) The early and late promos o, SV40 virus are parfcularly useful because bod, are 
Jd easr,y from the v ta s as a fragrant which aiso contains ,he SV40 v W ongm o 

the viral origin of replication. 

„ cases where an adenovims is used as an expression vec,o,, ,he coding sequences may 
ta ,i g a,ed ,o an adenovirus transcription/ transladon control complex, ,,, the .ate promoter and 
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tripartite leader sequence. This chimeric gene may then be inserted in the adenovirus genome by 
in vitro or in vivo recombination. Insertion in a non-essential region of the viral genome {e.g., 
region El or E3) will result in a recombinant virus that is viable and capable of expressing TLR- 
4 proteins in infected hosts. 

5 

Specific initiation signals may also be required for efficient translation TLR-4 protein 
coding sequences. These signals include the ATG initiation codon and adjacent sequences. 
Exogenous translational control signals, including the ATG initiation codon, may additionally 
need to be provided. One of ordinary skill in the art would readily be capable of determining 
10 this and providing the necessary signals. It is well known that the initiation codon must be in- 
frame (or in-phase) with the reading frame of the desired coding sequence to ensure translation 
of the entire insert. These exogenous translational control signals and initiation codons can be 
.% of a variety of origins, both natural and synthetic. The efficiency of expression may be enhanced 
^ by the inclusion of appropriate transcription enhancer elements, transcription terminators. 

gi In eukaryotic expression, one will also typically desire to incorporate into the 

transcriptional unit an appropriate polyadenylation site (e.g., 5-AATAAA-3') if one was not 

si 

O contained within the original cloned segment. Typically, the poly-A addition site is placed about 
\2 30 to 2000 nucleotides "downstream" of the termination site of the protein at a position prior to 
u 2 20 transcription termination. 

For long-term, high-yield production of recombinant TLR-4 proteins, stable expression is 
preferred. For example, cell lines that stably express constructs TLR-4 proteins may be 
engineered. Rather than using expression vectors that contain viral origins of replication, host 
25 cells can be transformed with vectors controlled by appropriate expression control elements 
(e.g., promoter, enhancer, sequences, transcription terminators, polyadenylation sites, etc.), and a 
selectable marker. Following the introduction of foreign DNA, engineered cells may be allowed 
to grow for 1-2 days in an enriched media, and then are switched to a selective media. The 
selectable marker in the recombinant plasmid confers resistance to the selection and allows cells 
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10 



,„ stably integrate the plasmid i„.o .hei, chromosomes and grow to form foci which in turn can 
be cloned and expanded into cell lines. 

A number of selection systems may be used, including, but no. limited, to the herpes 
staple* virus (HSV) tk, hypoxanthine-guanine phosphotransferase (hgprt) and adenme 
phosphotransferase genes (apt), m tk-, hgprt- or apri- cells, restive., Also, 
an,imen*o>i,e resistance can be used as the basis of se,ec«to„ for * that confers reststance to 
methotrexate; S p, -ha, confers resistance to mycophenolic acid; neo, that confers reststance to 
the aminoglycoside G-41 8; and hygro, that confers resistance to hygromyctn, 

Mima, cells can be propagated in *. in two modes: as non-anchorage dependent cells 
growing in suspension dtroughou, the bulk of the cutare or as anchorage-dependent cells 
string attachment to a solid substrate fo, men propagauon (U, a monolayer type of cel. 



m 15 



growth). 



I'll 

si 

0 

W 20 



Non-anchorage dependent or suspension cultutes from continuous established cel. hnes 
are the most widely used means of iarge scale production o, cells and cel, products. However, 
suspension cultured cells have litnitattons, such as tumorigenic potential and lower protetn 

production than adherent cells. 

Large scale suspension culture of mammalian cells in stirred tanks is a common method 
for production of recombinant proteins. Two suspension culture reactor designs are in w,d= use 
. the sttrred reactor and the airlift «,„, The stirred design has success** teen used on an 

25 aheight-to-diameterratioof 1:1 to3;l. The culture is usually mixed with one or more agrtators, 
tee d on bladed disks or marine propelier patterns. Agitator systems offering less shear forces 
than blades have been described. Agitation may be driven either directly or tndtrecdy by 
magnetically coupled drives. Indirect drives K duce the risk of microbial —nation through 



seals on stirrer shafts. 
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The airlift reactor, also initially described for microbial fermentation and later adapted 
for mammalian culture, relies on a gas stream to both mix and oxygenate the culture. The gas 
stream enters a riser section of the reactor and drives circulation. Gas disengages at the culture 
surface, causing denser liquid free of gas bubbles to travel downward in the downcomer section 
of the reactor. The main advantage of this design is the simplicity and lack of need for 
mechanical mixing. Typically, the height-to-diameter ratio is 10:1. The airlift reactor scales up 
relatively easily, has good mass transfer of gases and generates relatively low shear forces. 

It is contemplated that the TLR-4 protein of the invention may be "overexpressed", i.e., 
expressed in increased levels relative to its natural expression in cells. Such overexpression may 
be assessed by a variety of methods, including radio-labeling and/or protein purification. 
However, simple and direct methods are preferred, for example, those involving SDS/PAGE and 
protein staining or western blotting, followed by quantitative analyses, such as densitometric 
scanning of the resultant gel or blot. A specific increase in the level of the recombinant protein 
or peptide in comparison to the level in natural cells is indicative of overexpression, as is a 
relative abundance of the specific protein in relation to the other proteins produced by the host 
cell and, e.g., visible on a gel. 

H. Kits 

All the essential materials and reagents required for detecting TLR-4 polynucleotides or 
polypeptides may be assembled together in a kit. When the components of the kit are provided 
in one or more liquid solutions, the liquid solution preferably is an aqueous solution, with a 
sterile aqueous solution being particularly preferred. Each component preferably is supplied in a 
separate container. 

For therapeutic uses, a polynucleotide or candidate substance, as identified according to 
the methods disclosed herein, may be formulated into a single or separate pharmaceutically 
acceptable syringeable composition. In this case, the container means may itself be an inhalant, 
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syringe, pipette, eye dropper, or other such like apparatus, from which the formulation may be 
applied to an infected area of the body, injected into an animal, or even applied to and mixed 
with the other components of the kit. 

5 The components of these kits may also be provided in dried or lyophilized forms. When 

reagents or components are provided as a dried form, reconstitution generally is by the addition 
of a suitable solvent. It is envisioned that the solvent also may be provided in another container 
means. The kits of the invention may also include an instruction sheet defining administration 
of active compounds or explaining assays for detecting TLR-4 or TLR-4 mutants in samples. 

10 

The kits of the present invention also will typically include a means for containing the 
vials in close confinement for commercial sale such as, e.g., injection or blow-molded plastic 
"S containers into which the desired vials are retained. Irrespective of the number or type of 

W containers, the kits of the invention also may comprise, or be packaged with, an instrument for 

m 15 assisting with the injection/administration or placement of the ultimate complex composition 
within the body of an animal. Such an instrument may be an inhalant, syringe, pipette, forceps, 
*H measured spoon, eye dropper or any such medically approved delivery vehicle. Other 

Q instrumentation includes devices that permit the reading or monitoring of reactions in vitro. 

i{= 20 L Biological Functional Equivalents 

ig As will be understood by those of skill in the art, modification and changes may be made 

in the structure of the TLR-4 protein and subunits and still obtain a molecule having like or 
otherwise desirable characteristics. For example, certain amino acids may be substituted for 
other amino acids in a protein structure without appreciable loss of interactive binding capacity. 
25 Since it is the interactive capacity and nature of a protein that defines that protein's biological 
functional activity, certain amino acid sequence substitutions can be made in a protein sequence 
(or, of course, its underlying DNA coding sequence) and nevertheless obtain a protein with like 
(agonistic) properties. It is thus contemplated that various changes may be made in the sequence 
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, a ivina mM without appreciable loss of their biological 
of TLR-4 proteins or subunits (or underlying DNA) w.tnou W 

utility or activity. 

5 with counting (e.g., antagonistic, propert.es. Th,s relevan P 
whic „ TLR-4 rogues withon, endotoxin recognition act,v,ty are contemplated 
inhibiting the secretion of TNF. 

■ , • .1™., it is also well understood by the skilled artisan that, 
Tn tprms of functional equivalents, it is aiso wc 

moto le - s.iU ks»L in a molecnle - °J ^ 

Bioiogicaii, — ^— PeP— * . LI — - 

on n f the amino adds may be substituted, m y 
certain, not most or all, of the amino t 

pro.eins/peptides - different subs.tot.ons may easuy be made and 

the invention. 

„ „ „e» — d dra. where certain residues are shown .0 be particniaHy 
active site of an enzyme or .0 maintain protein function. 

25 iike. An anaiys. of me s.ze shape an » ^ ^ ^ ^ 

11::: based i — - — ~ - — * 
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alanine, glycine and serine; and phenylalanine, tryptophan and tyrosine; are defined herein as 
biologically functional equivalents. 

To effect more quantitative changes, the hydropathic index of amino acids may be 
5 considered. Each amino acid has been assigned a hydropathic index on the basis of their 
hydrophobicity and charge characteristics, these are: isoleucine (+4.5); valine (+4.2); leucine 
(+3.8); phenylalanine (+2.8); cysteine/cystine (+2.5); methionine (+1.9); alanine (+1.8); glycine 
(-0.4); threonine (-0.7); serine (-0.8); tryptophan (-0.9); tyrosine (-1.3); proline (-1.6); histidine 
(-3.2); glutamate (-3.5); glutamine (-3.5); aspartate (-3.5); asparagine (-3.5); lysine (-3.9); and 
10 arginine (-4.5). 



The importance of the hydropathic amino acid index in conferring interactive biological 

J function on a protein is generally understood in the art (Kyte & Doolittle, 1982, incorporated 

^ herein by reference). It is known that certain amino acids may be substituted for other amino 

u\ 15 acids having a similar hydropathic index or score and still retain a similar biological activity. In 

m making changes based upon the hydropathic index, the substitution of amino acids whose 

— hydropathic indices are within ±2 is preferred, those which are within ±1 are particularly 

O preferred, and those within ±0.5 are even more particularly preferred. 

: : E 

•e— 

~J 20 It is also understood in the art that the substitution of like amino acids can be made 

?fi effectively on the basis of hydrophilicity, particularly where the biological functional equivalent 

protein or peptide thereby created is intended for use in immunological embodiments, as in the 
present case. U.S. Patent 4,554,101, incorporated herein by reference, states that the greatest 
local average hydrophilicity of a protein, as governed by the hydrophilicity of its adjacent amino 
25 acids, correlates with its immunogenicity and antigenicity, i.e. with a biological property of the 
protein. 



As detailed in U.S. Patent 4,554,101, the following hydrophilicity values have been 
assigned to amino acid residues: arginine (+3.0); lysine (+3.0); aspartate (+3.0 ± 1); glutamate 
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(+3.0 ± 1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); glycine (0); threonine (-0.4); 
proline (-0.5 ± 1); alanine (-0.5); histidine (-0.5); cysteine (-1.0); methionine (-1.3); valine (-1.5); 
leucine (-1.8); isoleucine (-1.8); tyrosine (-2.3); phenylalanine (-2.5); tryptophan (-3.4). 

In making changes based upon similar hydrophilicity values, the substitution of amino 
acids whose hydrophilicity values are within ±2 is preferred, those which are within ±1 are 
particularly preferred, and those within ±0.5 are even more particularly preferred. 

While discussion has focused on functionally equivalent polypeptides arising from amino 
acid changes, it will be appreciated that these changes may be effected by alteration of the 
encoding DNA; taking into consideration also that the genetic code is degenerate and that two or 
more codons may code for the same amino acid. A table of amino acids and their codons is 
presented hereinabove for use in such embodiments, as well as for other uses, such as in the 
design of probes and primers and the like. 

i. Mutagenesis 

Site-specific mutagenesis is a technique useful in the preparation of individual peptides, 
or biologically functional equivalent proteins or peptides, through specific mutagenesis of the 
underlying DNA. The technique further provides a ready ability to prepare and test sequence 
variants, incorporating one or more of the foregoing considerations, by introducing one or more 
nucleotide sequence changes into the DNA. Site-specific mutagenesis allows the production of 
mutants through the use of specific oligonucleotide sequences which encode the DNA sequence 
of the desired mutation, as well as a sufficient number of adjacent nucleotides, to provide a 
primer sequence of sufficient size and sequence complexity to form a stable duplex on both 
sides of the deletion junction being traversed. Typically, a primer of about 17 to 25 nucleotides 
in length is preferred, with about 5 to 10 residues on both sides of the junction of the sequence 
being altered. 
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In general, the technique of site-specific mutagenesis is well known in the art. As will be 
appreciated, the technique typically employs a bacteriophage vector that exists in both a single 
stranded and double stranded form. Typical vectors useful in site-directed mutagenesis include 
vectors such as the M 13 phage. These phage vectors are commercially available and their use is 
generally well known to those skilled in the art. Double stranded plasmids are also routinely 
employed in site directed mutagenesis, which eliminates the step of transferring the gene of 
interest from a phage to a plasmid. 

In general, site-directed mutagenesis is performed by first obtaining a single-stranded 
vector, or melting of two strands of a double stranded vector which includes within its sequence 
a DNA sequence encoding the desired protein. An oligonucleotide primer bearing the desired 
mutated sequence is synthetically prepared. This primer is then annealed with the single- 
stranded DNA preparation, and subjected to DNA polymerizing enzymes such as E. coli 
polymerase I Klenow fragment, in order to complete the synthesis of the mutation-bearing 
strand. Thus, a heteroduplex is formed wherein one strand encodes the original non-mutated 
sequence and the second strand bears the desired mutation. This heteroduplex vector is then used 
to transform appropriate cells, such as £. coli cells, and clones are selected that include 
recombinant vectors bearing the mutated sequence arrangement. 

The preparation of sequence variants of the selected gene using site-directed mutagenesis 
is provided as a means of producing potentially useful species and is not meant to be limiting, as 
there are other ways in which sequence variants of genes may be obtained. For example, 
recombinant vectors encoding the desired gene may be treated with mutagenic agents, such as 
hydroxylamine, to obtain sequence variants. 

J. Inhibitors, Stimulators and Screening Assays 

In still further embodiments, the present invention provides methods for identifying new 
TLR-4 inhibitory or stimulatory compounds, which may be termed as "candidate substances." It 
is contemplated that such screening techniques will prove useful in the general identification of 
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any compound that will serve the purpose of inhibiting or stimulating TLR-4 directed signaling 
of TNF secretion, and in preferred embodiments, will provide candidate compounds. 

It is further contemplated that useful compounds in this regard will in no way be limited 
to proteinaceous or peptidyl compounds. In fact, it may prove to be the case that the most useful 
pharmacological compounds for identification through application of the screening assays will 
be non-peptidyl in nature and, e.g., which will serve to inhibit TNF secretion through a tight 
binding or other chemical interaction. Candidate substances may be obtained from libraries of 
synthetic chemicals, or from natural samples, such as rain forest and marine samples. 

L Assay Formats 
The present invention provides methods of screening for modulators of LPS mediated 
response by monitoring the standard activity profile of TLR-4 in the presence and absence of the 
candidate substance and comparing such results. It is contemplated that this screening technique 
will prove useful in the general identification of a compound that will serve the purpose of 
promoting, augmenting or increasing the activity of TLR-4 of a macrophage cell. Such 
compounds will be useful in the treatment of various bacterial infections. 

In these embodiments, the present invention is directed to a method for determining the 
ability of a candidate substance to stimulate the wild-type TLR-4 of cells that either naturally 
express TLR-4 or have been engineered to express TLR-4 as described herein. The method 
including generally the steps of: 

(i) providing a cell expressing a TLR-4 polypeptide; 

(ii) determining the activity of said TLR-4 polypeptide; and 

(iii) contacting said cell with a candidate substance; and 

(iv) comparing the TLR-4 activity of the cell in step (iii) with the TLR-4 
activity observed when said candidate substance is not added, 

-78- 



A: 225917(4%BH01!.DOC) 



10 



3_J 



25 



wherein an alteration in the activity indicates that said candidate substance is a modulator of said 
apoptotic activity. 

To identify a candidate substance as being capable of stimulating TLR-4 in the assay 
above, one would measure or determine the activity in the absence of the added candidate 
substance. One would then add the candidate substance to the cell and determine the activity in 
the presence of the candidate substance. A candidate substance which increases the activity or 
capacity relative to activity observed in its absence is indicative of a candidate substance with 
stimulatory capability. 



In particular embodiments, any compound that stimulates the production of IFN or 
related cytokines and mediates the inflammatory response to LPS or LPS containing moieties 
(e.g., Gram negative bacteria). As stated above, a "candidate substance" refers to any molecule 
that is capable of modulating the activity of TLR-4. The candidate substance may be a protein 
15 or fragment thereof, a small molecule inhibitor, or even a nucleic acid molecule. Accordingly, 
the active compounds may include fragments or parts of naturally-occurring compounds or may 
be found as active combinations of known compounds which are otherwise inactive. 



Accordingly, the present invention provides screening assays to identify agents which 
~] 20 stimulate a cellular immune and/or response, it is proposed that compounds isolated from natural 
\Q sources, such as animals, bacteria, fungi, plant sources, including leaves and bark, and marine 

samples may be assayed as candidates for the presence of potentially useful pharmaceutical 
agents. It will be understood that the pharmaceutical agents to be screened could also be derived 
or synthesized from chemical compositions or man-made compounds. 



Thus, it is understood that the candidate substance identified by the present invention 
may be polypeptide, polynucleotide, small molecule inhibitors or any other compounds that may 
be designed through rational drug design starting from known stimulators of immune and/or 
inflammatory response. For example, Barrett et ai, describe peptides and compounds that bind 
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the DL-1 receptor (U.S Patent 5,786,331, specifically incorporated herein by reference) and U.S. 
Patent 5,508,262 describes an IL-1 receptor agonist. It may be that, as TLR-4 is an analogue of 
the IL-1 receptor, agonists of IL-1 may also be agonists of TLR-4. Alternatively, known 
modulators of EL-1 receptors may prove to be a useful starting point in a rational drug design 
5 strategy that will yield experimentally, therapeutically or clinically relevant compounds that 
modulate the activity of TLR-4 and the immune response that TLR-4 mediates. 

The candidate screening assays are simple to set up and perform. Thus, in assaying for a 
candidate substance, after obtaining a cell expressing functional TLR-4, one will admix a 
10 candidate substance with the cell, under conditions which would allow measurable TNF 
secretion to occur. In this fashion, one can measure the ability of the candidate substance to 
stimulate the TNF secretory response of the cell in the absence of the candidate substance. One 
% would then measure the response in the presence of the candidate substance and determine the 

effect of the candidate substance. 

i'n 

•aw 

m is 

"Effective amounts" in certain circumstances are those amounts effective to reproducibly 
'* 1 stimulate the TNF secretory (inflammatory) response from the cell in comparison to their normal 

O levels. Compounds that achieve significant appropriate changes in activity will be used. 

*2 20 Significant changes in inflammatory response, e.g, y as measured TNF production, 

\Q splenocyte activity and the like are represented by an increase/decrease in the response of at least 

about 30%-40%, and most preferably, by changes of at least about 50%, with higher values of 

course being possible. 

25 It will, of course, be understood that all the screening methods of the present invention 

are useful in themselves notwithstanding the fact that effective candidates may not be found. The 
invention provides methods for screening for such candidates, not solely methods of finding 
them. 
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it Rational Drug Design 
The goal of rational drug design is to produce structural analogs of biologically active 
polypeptides or compounds with which they interact (agonists, antagonists, inhibitors, binding 
partners, etc.). By creating such analogs, it is possible to fashion drugs which are more active or 
5 stable than the natural molecules, which have different susceptibility to alteration or which may 
affect the function of various other molecules. In one approach, one would generate a three- 
dimensional structure for TLR-4 or a fragment thereof. This could be accomplished by x-ray 
crystallograph, computer modeling or by a combination of both approaches. An alternative 
approach, "alanine scan," involves the random replacement of residues throughout molecule 
10 with alanine, and the resulting affect on function determined. 



It also is possible to isolate a specific antibody, selected by a functional assay, and then 
% solve its crystal structure. In principle, this approach yields a pharmacore upon which 

l^j subsequent drug design can be based. It is possible to bypass protein crystallograph altogether 

|T1 15 by generating anti-idiotypic antibodies to a functional, pharmacologically active antibody. As a 
]S mirror image of a mirror image, the binding site of anti-idiotype would be expected to be an 

^ analog of the original antigen. The anti-idiotype could then be used to identify and isolate 

i: 

Q peptides from banks of chemically- or biologically-produced peptides. Selected peptides would 

{t then serve as the pharmacore. Anti-idiotypes may be generated using the methods described 

^ 20 herein for producing antibodies, using an antibody as the antigen. 

Thus, one may design drugs which have activity as stimulators, inhibitors, agonists, 
antagonists of TLR-4 or molecules affected by TLR-4 function. Such rational drug design may 
start with lead compounds already known to those of skill in the art. By virtue of the availability 
25 of cloned TLR-4 sequences, sufficient amounts of these proteins can be produced to perform 
crystallographic studies. In addition, knowledge of the polypeptide sequences permits computer 
employed predictions of structure-function relationships. Particularly useful agents that may be 
found by the present invention will be those agents that stimulate the TLR-4 receptor response to 
LPS and therefore increase the immune attack against LPS. Such agents may bind to the TLR-4. 
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K. Formulations and Routes for Administration to Patients 

Where clinical applications are contemplated, it will be necessary to prepare the 
expression vectors or candidate substances of the present invention as pharmaceutical 
5 compositions, i.e., in a form appropriate for in vivo applications. Generally, this will entail 
preparing compositions that are essentially free of pyrogens, as well as other impurities that 
could be harmful to humans or animals. 



One will generally desire to employ appropriate salts and buffers to render delivery 
10 vectors stable and allow for uptake by target cells. Buffers also will be employed when 
recombinant cells are introduced into a patient. Aqueous compositions of the present invention 
comprise an effective amount of the vector to cells, dissolved or dispersed in a pharmaceutically 
; ~ acceptable carrier or aqueous medium. Such compositions also are referred to as inocula. The 

W phrase "pharmaceutically or pharmacologically acceptable" refer to molecular entities and 

m 15 compositions that do not produce adverse, allergic, or other untoward reactions when 
]p administered to an animal or a human. As used herein, "pharmaceutically acceptable carrier" 

^ includes any and all solvents, dispersion media, coatings, antibacterial and antifungal agents, 

J! 

O isotonic and absorption delaying agents and the like. The use of such media and agents for 

{2 pharmaceutically active substances is well know in the art. Except insofar as any conventional 

'2 20 media or agent is incompatible with the vectors or cells of the present invention, its use in 
iJ3 therapeutic compositions is contemplated. Supplementary active ingredients also can be 

incorporated into the compositions. 



The active compositions of the present invention include classic pharmaceutical 
25 preparations. Administration of these compositions according to the present invention will be 
via any common route so long as the target tissue is available via that route. This includes oral, 
nasal, buccal, rectal, vaginal or topical Alternatively, administration may be by orthotopic, 
intradermal, subcutaneous, intramuscular, intraperitoneal or intravenous injection. Such 
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compositions would normally be administered as pharmaceutical^ acceptable compositions, 
described supra. 

The active compounds may be administered via any suitable route, including parenterally 
5 or by injection. Solutions of the active compounds as free base or pharmacologically acceptable 
salts can be prepared in water suitably mixed with a surfactant, such as hydroxypropylcellulose. 
Dispersions also can be prepared in glycerol, liquid polyethylene glycols, and mixtures thereof 
and in oils. Under ordinary conditions of storage and use, these preparations contain a 
preservative to prevent the growth of microorganisms. 

10 

The pharmaceutical forms suitable for injectable use include sterile aqueous solutions or 
dispersions and sterile powders for the extemporaneous preparation of sterile injectable solutions 
■S or dispersions. In all cases the form must be sterile and must be fluid to the extent that easy 

*W syringability exists. It must be stable under the conditions of manufacture and storage and must 

m 15 be preserved against the contaminating action of microorganisms, such as bacteria and fungi. 
% The carrier can be a solvent or dispersion medium containing, for example, water, ethanol, 

polyol (for example, glycerol, propylene glycol, and liquid polyethylene glycol, and the like), 
p suitable mixtures thereof, and vegetable oils. The proper fluidity can be maintained, for 

j~* example, by the use of a coating, such as lecithin, by the maintenance of the required particle 

i{| 20 size in the case of dispersion and by the use of surfactants. The prevention of the action of 
; : q microorganisms can be brought about by various antibacterial an antifungal agents, for example, 

parabens, chlorobutanol, phenol, sorbic acid, thimerosal, and the like. In many cases, it will be 
preferable to include isotonic agents, for example, sugars or sodium chloride. Prolonged 
absorption of the injectable compositions can be brought about by the use in the compositions of 
25 agents delaying absorption, for example, aluminum monostearate and gelatin. 

Sterile injectable solutions are prepared by incorporating the active compounds in the 
required amount in the appropriate solvent with various of the other ingredients enumerated 
above, as required, followed by filtered sterilization. Generally, dispersions are prepared by 
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incorporating the various sterilized active ingredients into a sterile vehicle which contains the 
basic dispersion medium and the required other ingredients from those enumerated above. In the 
case of sterile powders for the preparation of sterile injectable solutions, the preferred methods 
of preparation are vacuum-drying and freeze-drying techniques which yield a powder of the 
5 active ingredient plus any additional desired ingredient from a previously sterile-filtered solution 
thereof. 

As used herein, "pharmaceutical^ acceptable carrier" includes any and all solvents, 
dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying 
10 agents and the like. The use of such media and agents for pharmaceutical active substances is 
well known in the art. Except insofar as any conventional media or agent is incompatible with 
the active ingredient, its use in the therapeutic compositions is contemplated. Supplementary 
^ active ingredients also can be incorporated into the compositions. 

\T) 15 For oral administration the polypeptides of the present invention may be incorporated 

with excipients and used in the form of non-ingestible mouthwashes and dentifrices. A 
mouthwash may be prepared incorporating the active ingredient in the required amount in an 

G appropriate solvent, such as a sodium borate solution (Dobell's Solution). Alternatively, the 

if* 

2 active ingredient may be incorporated into an antiseptic wash containing sodium borate, glycerin 

20 and potassium bicarbonate. The active ingredient may also be dispersed in dentifrices, 
if) including: gels, pastes, powders and slurries. The active ingredient may be added in a 

therapeutically effective amount to a paste dentifrice that may include water, binders, abrasives, 

flavoring agents, foaming agents, and humectants. 

25 The compositions of the present invention may be formulated in a neutral or salt form. 

Pharmaceutically-acceptable salts include the acid addition salts (formed with the free amino 
groups of the protein) and which are formed with inorganic acids such as, for example, 
hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, tartaric, mandelic, and 
the like. Salts formed with the free carboxyl groups also can be derived from inorganic bases 
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such as, for example, sodium, potassium, ammonium, calcium, or ferric hydroxides, and such 
organic bases as isopropylamine, trimethylamine, histidine, procaine and the like. 

Upon formulation, solutions will be administered in a manner compatible with the 
5 dosage formulation and in such amount as is therapeutically effective. The formulations are 
easily administered in a variety of dosage forms such as injectable solutions, drug release 
capsules and the like. For parenteral administration in an aqueous solution, for example, the 
solution should be suitably buffered if necessary and the liquid diluent first rendered isotonic 
with sufficient saline or glucose. These particular aqueous solutions are especially suitable for 
10 intravenous, intramuscular, subcutaneous and intraperitoneal administration. 

In this connection, sterile aqueous media which can be employed will be known to those 
of skill in the art in light of the present disclosure. For example, a unit dose could be dissolved 
W in 1 ml of isotonic NaCl solution and either added to 1000 ml of hypodermoclysis fluid or 

m 15 injected at the proposed site of infusion, (see for example, "Remington's Pharmaceutical 
[2 Sciences" 15th Edition, pages 1035-1038 and 1570-1580). Some variation in dosage will 

necessarily occur depending on the condition of the subject being treated. The person 
p responsible for administration will, in any event, determine the appropriate dose for the 

U individual subject. Moreover, for human administration, preparations should meet sterility, 

20 pyrogenicity, general safety and purity standards as required by FDA Office of Biologies 
*D standards. 

L. Examples 

The following examples are included to demonstrate preferred embodiments of the 
25 invention. It should be appreciated by those of skill in the art that the techniques disclosed in the 
examples which follow represent techniques discovered by the inventor to function well in the 
practice of the invention, and thus can be considered to constitute preferred modes for its 
practice. However, those of skill in the art should, in light of the present disclosure, appreciate 
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that many changes can be made in the specific embodiments which are disclosed and still obtain 
a like or similar result without departing from the spirit and scope of the invention. 

EXAMPLE 1 
Materials And Methods 

Mice 

[SWR x C3H/HeJ] x C3H/HeJ F2 animals. SWR mice were obtained from Jackson 
Laboratories, and maintained in the Animal Resource Center of the University of TX, 
Southwestern Medical Center. C3H/HeJ mice were also obtained from Jackson Laboratories 
(Bar Harbor, ME). Fl mice obtained by the cross of C3H/HeJ males to SWR females were 
backcrossed at eight weeks of age to C3H/HeJ animals of both sexes. 493 F2 animals were used 
at six to eight weeks of age for analysis of LPS response phenotype and preparation of genomic 
DNA. In this panel of animals, both splenocyte responses and macrophage TNF production 
were assayed as endpoints of endotoxin response determination. 

[C57BL/6 x C3H/HeJJ x C3H/HeJ F2 animals. 1600 F2 mice were produced at the 
Jackson Laboratories, and shipped to the Animal Resource Center of the University of TX, 
Southwestern Medical center at 5 to 8 weeks of age. Animals were immediately ear-tagged for 
identification and tail cuttings were taken for identification. For this panel of animals, selection 
of individuals with recombination between markers D4MU164 and D4MIT27 was made by 
PCR analysis. All but a few non-recombinants were sacrificed; the remaining non-recombinants 
were used as controls (obligate Lps d heterozygotes or homozygotes with representative genetic 
variability at other loci) with each assay series. In this panel of animals, macrophage TNF 
production was used as the sole endpoint of endotoxin response determination. 

Preparation of genomic DNA 

Tail fragments approximately 5 mm in length were placed in 1.5 ml microcentrifuge 
tubes containing 650 |il of SSTE buffer, in which proteinase K had been dissolved at a 
concentration of 1 mg/ml. SSTE buffer was made by bringing 10 ml of 10% SDS, 2 ml of 5 M 
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NaCl, 5 ml of 1M tris, pH 8.0, and 3 ml of 0.5 M EDTA, pH 8.0 to a final volume of 100 ml 
with distilled water. The tissue was digested overnight at 55 °C. 30 ^1 of RNase Tl (5 U/|il) 
was then added to each tube, and a second incubation was carried out at 37 °C for one hour. 
DNA was then extracted with phenol, 1 : 1 phenol/chloroform solution, and finally chloroform. 
After addition of 1/10 volume of 3M sodium acetate solution, samples were precipitated with 
two volumes of ethanol. Pellets were dried and redissolved in 300 \il of TE solution. 

Assays for LPS responsiveness 

Splenocyte proliferation assays were used to type all 493 animals produced by the 
backcross of C3H/HeJ mice to SWR mice, and were performed by harvesting 30 to 40 million 
spleen cells from each animal. Erythrocytes were lysed with ammonium chloride, and the 
remaining cell population, predominantly lymphocytes, were plated at a density of 1 million 
cells per well, in triplicate, in round bottom 96-well plates. Cells were stimulated with Con A as 
a control for viability, and also with LPS at two concentrations (1.0 |ig/ml and 5.0 ng/ml) for a 
period of 72 hours, in the presence of tritiated thymidine. All samples showing Con A 
responses that exceeded 100,000 CPM were retained for analysis; those occasional samples that 
had smaller Con A responses were discarded from consideration. A cell harvester was used to 
collect the lymphocytes, and measurements of thymidine incorporation were made. An index of 
LPS responsiveness (I) was calculated for each sample as follows: 

I = [mean CPM of LPS-induced splenocytes]/[mean CPM of non-induced splenocytes] 

For simplicity, only the data pertaining to stimulation with 5 ng/ml LPS are reported in 
this paper. 

TNF production by peritoneal macrophages was used to type all 1600 animals 
produced by the backcross of C3H/HeJ mice to C57BL/6 mice. TNF mediated cytolytic activity 
is released by LPS-responsive cells following endotoxin stimulation in vitro, and was used to 
quantitate LPS sensitivity. Eight-week-old mice were injected intraperitoneally with sterile 
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Brewer's thioglycollate broth (3.0 ml). 4 days later, peritoneal exudate macrophages were 
harvested by peritoneal lavage. 2 x 10 5 cells from each mouse were plated in 24 well plates, and 
after adherence, stimulated by LPS at concentrations of 0.01 and 0.5 |ig/ml. After 16 hours, the 
medium was harvested and assayed for TNF content. TNF assays were performed using the 
L929 cell method, in which cells were plated in 96 well plates, exposed to the macrophage 
medium, and after rinsing, stained with crystal violet. The intensity of staining, measured in a 
microplate reader at 590 nm, corresponded to the fraction of surviving cells (Cseh and Beutler, 
1989). Results were expressed as % cytotoxicity, defined as: 

[1- OD[ sam plewell]/OD[control] ] * 100 

Both the splenocyte proliferation assay and the macrophage response assay permitted 
unambiguous discrimination between responders (heterozygotes for the Lps d allele) and 
nonresponders (homozygotes for the Lps d allele), but the macrophage response assay showed 
less day-to-day variability and was more convenient to perform. 

Contig Assembly. 

YACs were isolated from the Research Genetics mouse YAC library by PCR-based 
screening of a matrix consisting of superpools, pools, and plates. The probes initially employed 
in this process were the map-pair markers: D4MIT82, D4MIT218, D4MIT325, D4MIT244, 
D4MIT25, NdS9, D4MIT178, D4MIT7, D4MIT132, and D4MIT83. A total of 20YACs were 
isolated in this manner. All were sized by pulsed-field gel electrophoresis. The ends of all of 
the YACs were checked by sequencing. In some cases, continuity with adjoining BACs was 
immediately established. When highly repetitive sequence was present, internal sequences were 
determined by cloning rather large fragments of the YAC that contained the end. If when no 
overlap was established, additional YACs were isolated using the end sequence. In several 
instances, after two such cycles of chromosome walking, it was concluded that the original 
YACs were chimeric. 
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Unique sequences derived from shotgun cloning of YACs, as well as the above 
mentioned D4MIT markers, were used to identify BAC clones, chiefly by screening the 
Research Genetics mouse BAC library. Two additional clones (BACs 84C8 and 389F15) were 
obtained by screening BAC libraries maintained at Genome Systems (St. Louis, MO). A 
contiguous span of BACs crossing most of the interval was produced, as in the case of the YAC 
contig, by chromosome walking. Among the BACs that were isolated, only two proved to be 
chimeric; the chimeric BACs are not presented in any of the Figures. 

Novel marker identification. 

YACs were fragmented by ultrasonic shearing and subcloned for internal sequence 
analysis. Both random, non-repetitive sequences and novel microsatellite repeats were isolated 
in this manner. More than 100 such markers were identified across the region in question. 
These markers were used for PCR-based screening of a mouse BAC library from Research 
Genetics (Birmingham, AL), and permitted the isolation of BAC clones. The PCR primers 
defining all markers relevant to the Figures are presented in Table I. 



A: 22S917(4*BH01!.DOO 



-89- 



W3 

*s 

a 
u 

s 

u 



O 

z 
to 

HH 

a 

a 

1/3 

N — ✓ 

o 

SP 

o 

SP 

03 

o 



o3 

3 

o 

03 
03 

bx> 



QO 

6 
Z 

a 

HH 

o» 
w 

s 

o 
o 



03 
o 

•4— » 

too 

bX) 
bJQ 

03 

bD 
o3 

a 



o 
z 
a 

HH 

a 

to 

bX) 

o 
a 

o3 
o3 
03 

o 
o3 
bX) 
4— » 

o 
cd 

03 
c3 

3 



s 9 
z S 



to 

C/) 



C 

to 

c» 
^— ^ 

03 

2? 

bX) 
o3 

O 
c3 
O 
^— » 
c3 

bX) 



O 
o 
o3 
03 
U 

o 

•4— > 

u 

03 

o 

bX) 

■4— > 

o3 
03 
O 
bX) 
03 
03 



bX) bX> 
bX) o3 



1-H 

2 ° 
z z 

HH B 

w to 



^-t (N ^ CN — < 



O 

o3 
O 

O 
b0 

ts 

bo 

o 

S? 
o 
o 
o 



o 
z 
a 

C 

w 

bO 

So & 



03 
03 
03 



o3 
O 
bO 



o3 
O 



bX) 

■4— > 

bX) 



^H 

6 
z 

Q 

HH 

a 

bX) 
bO 



o3 



3 

03 



03 

bJO 

bo 



■4— 03 

03 



bX) 
O 
O 

■4—1 
•4—1 

bX) 



V© 
i— I 

6 
z 

Q 

HH 

a 

to 

CO 

bX) 
bl) 



T2 2 



03 
O 



w ^ C cd 
03 +-» -4-» 

-rt 



O 

o 
bX) 



03 



O 

z 

Q 

HH 

C 

W 

bX) 
o3 
o3 
O 
bD 
o3 
03 
bX) 

•4— » 
■4— » 

bX) 
bX) 
o3 
o3 
bX) 
bX) 



O 

z 

Q 

HH 

a 

to 

CO 

3 

bp 

O 

o3 
03 

bO 

4— » 
-4— > 
■4—1 

o3 
bO 

4— * 

o3 
O 

-4— » 

60 



O 

z 
a 

HH 

a 

to 

CO 

bD 
03 

o3 
03 

& 

03 

8P 

03 
o3 

SP 

o3 
O 
O 

bX) 



O O 

z z 

Q a 

HH " 

a o 

w w 

co $£, 



o 
o 
bX) 

03 
O 

u 
bX) 
bo 



o 
o 

bXj 
O 



03 
03 

o 
o 



o 

■4— » 

03 
o3 

■4— > 



z z 

a q 

" HH 

w §^ 



5£ 

o 
bX) 



03 
o 

bX) 

o 
o 



bX) 
03 

•4— > 

bx> 

•4— » 

bX> 
c3 

it 
& 

a 
o 

a 
o 



^ <ni ^ " <N 



O 
0\ 



IJ1 



12 

i 

c 

o3 
> 



bX) 

s 



(A 

IS 

a 

© 

E 

Ch 



E 

v> 

IS 
a 

u 
o 

E 
"5 

Cm 



to 
IO 

•-5 



> 

S 



Q 

Z 



Q 

Z 



a, 
U 



CO 

H 



c 

C3 
C 

.2? 

T5 
u 

C3 



Q 



CN 

S 

Q 



00 
(N 

Q 



oo 



Q 



in 

CO 



Q 



IT) 



CN 



ON 



1- 

6 
z 

Q 

w 

O 
bD 

cd 
cd 

•4— » 

bp 

bb 

■4-> 

bD 

•4-1 

bp 

bb 



O 
Z 
Q 



O 
Z 
Q 



00. & 



o 
cd 
bD 



cd 

3 



8 

cd 
bD 

cd 

g 



bD S 



cd 

•4— > 

bD 



6 
z 

Q 



J3 cd # 



O 

o 
cd 
cd 

-4— > 

cd 
o 
u 
cd 

gp 

cd 
o 



oe 

6 
z 

o 



o 



bD 
cd 
cd 
cd 
bD 



bD 

5 ^ 

"cd 

3> 

o 
cd 

9 

bD 
cd 



o s 

a 2 

o g 

CO © 

r w 



■4— » 

o 
u 
o 
bD 



6 
z 



cd 



o 

Cfl cd 

cd 

o 
u 



bD bD 
bD bD 



bD 

bD 
bD 



cd 
bD 
bD 
cd 
o 

o 
bD 



o 
bD 



bD 
cd 
o 



cd 
bD 
bD 



b£) 
cd 
cd 



0 
Z 



m 

6 
Z 
Q 



S <^ 

bJO 

bD Sd 



cd 
bD 



^ ^ bD 
C cd i-i 



2 a a g 

2 s a a £o g 

cd o *- *j hr 
cd 



cd 
cd 



bD 
cd 

SP 

bD 

•4—1 

bD 

SP 



S i 
o g 

to 



as 

as 

8 



cd 
bD 



o 
bD 



bp ^ 
cd cd 



cj 



bD bD 
ti bD 



o 

z 

Q 



6 
z 

a a 

co 



cd 
cj 
cd 

S 
cd 

o 

o 
cd 
cd 
cd 

% 
bD 
bD 



cd 

bD 
<*—» 

60 

cd 
bD 
cd 
o 

cd 
o 



6 
z 
a 



— | (N — (N (N (N — I (N 



W 

cd 
o 
bD 

■4— » 

o 
o 



cd 
o 

td 

cd 
bD 
cd 

-4-> 
td 

C^ 



O 

6 
z 



a a 

CO 



bD 
cd 
bp 

So 



bD 
cd 
cd 
cd 
o 
bD 



g 

a 
o 



s» 

cd 
cd 
bD 
bD 
cd 
bD 
cd 
bD 
bD 

as 



- N « M -I (\j 



Q 



Q 
Z 



P 

z 



Q 
Z 



Q 
Z 



00 



s 

a. 
© 
E 

*© 

Oh 



a 

u 

o 
E 

>* 

2 



6 

z, 

Q 

o» 

W 

CO 



6 
z 
a 



.is 
"3 
a 
i- 

s 



02 

IT) 

a 



c 

o 

•mm 

e 

(/} 
a> 

u 

C3 



em 
o 

3 

-4— » 

bo 

o 
"So 

•4— • 

o 

s? 

o 
oi 



b0 

a 
o 

■4—) 

o 
o 
bo 

bO 
■*-» 

gp 
gp 

bQ 



6 
z 

Q 

a 

w 

CO 

O 

a 

bJO 
» 

bJQ 

•4—* 
-1—* 

£P 

W> 
^— > 

bO 
CO) 
b0 



CO 
00 



On 



Shotgun sequencing. 

Using a minimal contig containing 24 BACs and one YAC, which together encompassed 
>95% of the interval between markers B and 83.3, approximately 20 million bases of high- 
quality sequence were obtained, from bidirectional reads of approximately 20,000 fragments of 
DNA (i.e., approximately 40,000 reads in all). In this process, all BACs and the YAC clone 
100E4 were fragmented by ultrasonic shearing, repaired to a blunt-ended state using Klenow 
fragment or mung bean nuclease, and subcloned into the vector pBluescript, which was been cut 
with Smal and treated with calf intestine alkaline phosphatase to prevent self ligation. The 
average fragment size was about 1.5 kb. Automated sequencing was performed using ABI model 
377A sequencers, and chain termination chemistry was used by all four of these sequencing 
laboratories. 

Bioinformatic analysis. 

All sequences were hand-edited or processed by Phred and Phrap (obtained from Brent 
Ewing and Phil Green, respectively; University of Washington Genome Center) to remove 
vector, and to assemble as completely as possible. Individual reads were stored as a growing 
database in a single large directory, and subjected to the following tests, all of which were carried 
out using Genetics Computing Group (GCG) software, or via web or e-mail servers: 

1. All sequences were periodically compared to all other sequences in the database using 
the Fasta search algorithm, in order to establish and display regions of overlap and homology. 

2. All sequences were masked to hide common repetitive elements from consideration 
prior to searches against the large sequence databases listed below. Censoring was usually 
carried out using the program RepeatMasker (obtained from Arian Smit, University of 
Washington). 
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3. All sequences, in censored and uncensored form, were subjected to blastn analysis 
using the genembl, dbest, and HCD databases, the latter at levels I and n. 

4. All sequences, in censored (masked) and uncensored form, were subjected to blastx 
5 analysis using the genembl and HCD databases, the latter at levels I and II. 

5. In some select instances, sequences were studied locally using the framesearch 
algorithm to detect open reading frames with homology to components of the Swissprot 
database. 

10 

6. All sequences were analyzed for rare peptide motifs by translating them in all six 
frames and subjecting them to a local Motif search. 

7. All sequences were, at various stages, aligned with others using the GCG program 
15 Gelmerge, or using the program Phrap. In this manner, long contiguous sequences were obtained 

and unambiguous overlap between adjacent BACs, as well as the approximate extent of overlap, 
could be inferred. 

8. Individual sequencing reads, or when possible, contigs of reads, were subjected to 
20 analysis by GRAIL 2 in order to identify putative exons. 

Exon trapping. 

A total of 169 exons were trapped from the "zero area" of the critical region.- Exon 
trapping was performed using the vector pSPL3 (Burn et a/., 1995). Digested genomic DNA was 
25 ligated into the vector, and pooled clones (generally 20 to 100 at one time) were transfected into 
COS-7 cells, obtained from the ATCC. Capture of putative exons was accomplished by PCR. 
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Shotgun sequencing. 

Using a minimal contig containing 24 BACs and one YAC, which together encompassed 
>95% of the interval between markers B and 83.3, approximately 20 million bases of high- 
5 quality sequence were obtained, from bidirectional reads of approximately 20,000 fragments of 
DNA (i.e., approximately 40,000 reads in all). In this process, all BACs and the YAC clone 
100E4 were fragmented by ultrasonic shearing, repaired to a blunt-ended state using Klenow 
fragment or mung bean nuclease, and subcloned into the vector pBluescript, which was been cut 
with Smal and treated with calf intestine alkaline phosphatase to prevent self ligation. The 
10 average fragment size was about 1.5 kb. Automated sequencing was performed using ABI model 
377A sequencers, and chain termination chemistry was used by all four of these sequencing 
laboratories. 

§7i Bioinformatic analysis. 

2? 15 All sequences were hand-edited or processed by Phred and Phrap (obtained from Brent 

*B Ewing and Phil Green, respectively; University of Washington Genome Center) to remove 

iff vector, and to assemble as completely as possible. Individual reads were stored as a growing 

jL database in a single large directory, and subjected to the following tests, all of which were carried 

%g out using Genetics Computing Group (GCG) software, or via web or e-mail servers: 

m 20 

^ 1. All sequences were periodically compared to all other sequences in the database using 

the Fasta search algorithm, in order to establish and display regions of overlap and homology. 

2. All sequences were masked to hide common repetitive elements from consideration 
25 prior to searches against the large sequence databases listed below. Censoring was usually 
carried out using the program RepeatMasker (obtained from Arian Smit, University of 
Washington). 
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3. All sequences, in censored and uncensored form, were subjected to blastn analysis 
using the genembl, dbest, and HCD databases, the latter at levels I and II. 

4. All sequences, in censored (masked) and uncensored form, were subjected to blastx 
5 analysis using the genembl and HCD databases, the latter at levels I and n. 

5. In some select instances, sequences were studied locally using the framesearch 
algorithm to detect open reading frames with homology to components of the Swissprot 
database. 

10 

6. All sequences were analyzed for rare peptide motifs by translating them in all six 
frames and subjecting them to a local Motif search. 

y 7. All sequences were, at various stages, aligned with others using the GCG program 

I~ 15 Gelmerge, or using the program Phrap. In this manner, long contiguous sequences were obtained 

^fl and unambiguous overlap between adjacent BACs, as well as the approximate extent of overlap, 

in could be inferred. 

b 

! ^ 8. Individual sequencing reads, or when possible, contigs of reads, were subjected to 

in 20 analysis by GRAIL 2 in order to identify putative exons. 

; 5 : 

Exon trapping. 

A total of 169 exons were trapped from the "zero area" of the critical region. Exon 
trapping was performed using the vector pSPL3 (Burn et al, 1995). Digested genomic DNA was 
25 ligated into the vector, and pooled clones (generally 20 to 100 at one time) were transfected into 
COS-7 cells, obtained from the ATCC. Capture of putative exons was accomplished by PCR. 
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Hybridization selection of expressed cDNAs using BACs. 

This method was carried out as described by Rommens et aL (1998), using cDNA from 
RAW 264.7 cells (obtained from the ATCC). A total of 538 selected clones from the "zero area" 
were examined in the course of the present study. 

Fluorescence in situ hybridization. 

Six separate distance measurements were made at Genome Systems, Inc., using 
interphase nuclei of mouse cells. The measurements included distances between the following 
BACs, which were used as probes: 

[L22 and 012], [012 and N13], [N13 and B6], [B6 and B22], [B22 and K4], [B22 and 

L15]. 

The minimum distance that may be measured through this approach is 0.2 Mb, and the 
maximum distance that may be measured is 1.6 Mb. A 10% margin of error is certified for all 
measurements. 

Genetic computation. 

A 500 Mhz DECa computer (DCG Viper EV-56) equipped with 63 Gb of hard drive 
space and 256 Mb of RAM was used in all of these studies. Programs were run under a Digital 
UNIX operating system. For the design of primer pairs to be used in PCR and sequence 
extension, the program Gene Runner was used (Hastings Software, Inc). The graphics program 
used in depiction of the contig was Hijaak Draw (Inset Graphics, Inc). Each of the latter two 
programs was run under Windows95 using a Pentium Pro computer from Gateway, Inc. 

EXAMPLE 2 
LPS Response Assays and Validation Thereof 

Both splenocyte proliferation assays and assays of TNF production were used to 
distinguish LPS nonresponder mice (Lps d homozygotes) LPS responders {Lps d heterozygotes) 
were used in analysis of the [SWR x C3H/HeJ]Fl x C3H/HeJ panel. Both assays were carried 

95 



A:2259I7(4*BHOI!.DOC) 



out using two separate LPS concentrations. When splenocyte proliferation was used as an 
endpoint, a simultaneous assay of proliferation occurring in response to concanavalin A was used 
as a means of controlling for cell viability. In the [SWR x C3H/HeJ]Fl x C3H/HeJ panel, 
conflict between the two assays was rarely observed, and mice that were felt to be ambiguous in 
5 their responsiveness to LPS were discarded from further consideration. 

Experience with the two assays of LPS response showed that measurement of TNF 
production occurring at low (10 ng/ml) concentrations of LPS was a more reliable index of 
responsiveness than measurement of splenocyte proliferation. Therefore, this assay was used 

10 exclusively in analysis of the [C57BL/6 x C3H/HeJ]Fl x C3H/HeJ backcross panel. Since not 
all of the potential recombinants were assayed for LPS responsiveness on the same day, and since 
day-to-day variability in TNF measurement might confuse interpretation of the results, a series of 
obligate Lps d allele homozygotes and heterozygotes were identified by analysis of flanking 
markers, and included as controls with each assay performed. The distinction between 

15 responder and non-responder control groups was in every instance clear and unambiguous. 
Similarly, the designation of each individual recombinant as homozygous or heterozygous with 
respect to the Lps locus was accomplished without ambiguity. 

EXAMPLE 3 

20 [SWR x C3H/HeJ] x C3H/HeJ Backcross 

In initial efforts to confine the Lps d mutation to a point between two markers, animals of 
the [SWR x C3H/HeJ] x C3H/HeJ backcross were examined. Raw data are not presented as this 
was essentially a range-finding study. A total of eleven D4MIT markers flanking the gene could 
be used for genotypic analysis, as the two parental genotypes could be distinguished on the basis 
25 of length differences. The crossover frequency between each of these markers and the Lps gene 
in a panel of 493 mice is illustrated in Figure 1. A cluster of four markers (D4MIT82, 325, 218, 
and 244) which underwent no recombination with one another each exhibited a single crossover 
event with the Lps mutation. This cluster of markers lay centromeric to the mutation. On the 
distal side of the mutation, D4MTT80 was separated from Lps d by 14 recombination events. 
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From the fact that only a single crossover event was observed between D4MIT82 and 
Lps d , it was inferred that the mutation might well lie within the proximal half of the D4MTT82 -» 
D4MIT80 interval. Partial ordering of microsatellite markers between D4MIT82 and D4MIT80 
5 was achieved by Bell, et al., who employed a series of deletion mutants to map markers in the 
region of the b locus (Bell et ai, 1995). A YAC clone containing D4MTT83 (which was mapped 
to the middle portion of the interval) was isolated, fragmented, and used to identify fresh 
microsatellite repeat polymorphisms. One of these, designated CA83.3, lay distal to the mutation 
and was separated from it by only six crossover events. Moreover a total of six polymorphic 
10 microsatellites (designated CA25.5, CA25.15, CA9.2, CA7.1, CA7.3, and CA7.11) were 
identified in three other YAC clones which were isolated using the markers D4MIT25, NdS9, 
and D4MIT7 . These six new markers all co-localized with the mutation (Figure 2). When 
physical mapping (detailed below) revealed that some of the new markers were separated from 
others of the group by as much as 1.6 Mb of DNA, it appeared certain that a second backcross 
;2 1 5 would be required to narrow the interval. 

m EXAMPLE 4 

U [C57BL/6 x C3H/HeJ] x C3H/HeJ Cross 

C 5 A total of 1600 F2 mice made by crossing C57BL/6 x C3H/HeJ and backcrossing to 

3?| 20 C3H/HeJ were analyzed for recombination in the region of the Lps gene. Those mice that 



•&2 



25 



showed such recombination were further analyzed for LPS responsiveness, and fine mapping of 
the recombination events was accomplished using novel microsatellite markers. The inventors 
examined markers over a relatively broad range in a screen for crossover events. This range 
spanned the interval D4MIT164 to D4MTT27, and included D4M1T178. 

A total of 192 crossovers were observed between D4MIT164 and D4MTT27 in the 1600 
meioses studied. 157 of these crossovers occurred between D4MIT178 and D4MIT27, while 35 
crossovers occurred between D4MIT164 and D4MIT178. The location of these events could be 
further assigned based on the identification of six novel polymorphisms within the D4MIT164 
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D4MIT27 interval (Figure 2). Five novel markers (designated A, D, E, 7.3 and 7.11) still co- 
localized with the Lps d mutation, as did marker D4MIT178. The physical separation between all 
four of these microsatellite markers (which were isolated from two overlapping BAC clones) 
was approximately 1.2 Mb, corresponding to the "zero region" of the critical area. Four 
5 crossovers were observed between Lps d and marker B. In terms of physical size, the B— > 83.3 
interval corresponds to approximately 2.6 Mb. Because the crossovers between B and 83.3 are 
essential to confinement of Lps d , the primary assay data for each of the seven critical meioses are 
presented in Figure 3. 

10 EXAMPLE 5 

The physical map 

A map of the entire interval between D4MIT325 and marker 83.3 is shown in Figure 4. 
A total of 63 BAC clones and 20YAC clones (only one of which is shown, in red) were isolated 
U to span the critical region. Each YAC and BAC clone is drawn to scale according to analysis by 

m 15 pulsed-field gel electrophoresis. A FISH map is drawn above the assembly, and is in 
'jf approximate agreement with the electrophoretic map. The black bar in Figure 4 indicates the 

m region of zero recombination determined on the basis of 493 meioses analyzed by the cross 

p [SWR x C3H/HeJ] x C3H/HeJ and 1600 meioses analyzed by the cross [C57BI76 x C3H/HeJ] x 

rt C3H/HeJ. This "zero area" is approximately 1.2 Mb in length. The critical region, known with 

111 20 certainty to contain the Lps d mutation, extends over a distance of approximately 2.6 Mb, 
-,n circumscribed by markers "B" and D4MIT83.3. 

EXAMPLE 6 
Gene identification 

25 

169 putative exons were identified throughout the "zero area" of the contig, and in 
selected portions that lay beyond the zero area, by exon trapping. 538 clones were isolated from 
the zero area and beyond by hybridization/selection. Some of each of these represented 
fragments of known genes and pseudogenes, including Pappa, HMG-I and KIAA0029 (detected 
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by both exon trapping and hybridization selection). TCP-1 and BTG-1 were detected by 
hybridization selection, though not by exon trapping. Most of the sequences identified by the 
two methods could not be shown to encode authentic, expressed genes despite exhaustive 
analysis through northern blotting and other techniques. 

High density sequence analysis was carried out over an interval containing 24 minimally 
overlapping BACs and one YAC clone. These BACs are depicted in Figure 5. BACs shown in 
blue were sequenced either to completion or to the point of finishing (i.e., only a few gaps remain 
between large assemblies of sequence). BACs shown in red were sequenced with to a high 
density (several hundred reads each) but not to the point of finishing, and are still considered "in 
progress." YAC 100E4 was analyzed with nearly 1000 reads. The genes and pseudogenes 
detected within the region by homology searches are indicated as well. 

A large number of exons were predicted by GRAIL analysis. However, many of these 
were of retroviral origin (e.g., exons within LI, Bl, Bam5, and IAP repeats). Among those that 
were not, some belonged to known genes such as Pappa. Among all GRAIL-predicted, non- 
repetitive exons with ratings of "good" to "excellent," none could be demonstrated to encode 
authentic macrophage-expressed transcripts when used as probes on northern blots. 

Pseudogenes encoding fragments of TCP-1, BTG-1, NKR-P12, KIAA0029, HMG-1, and 
cytochrome P450 were identified within the contig. However, all of these genes were either 
fragmentary or contained nonsense mutations. Only two authentic genes were identified within 
the contig. These were the classical marker gene Pappa, encoding a plasma metalloproteinase, 
and the mouse homolog of TLR-4, encoding a transmembrane protein homologous to the IL-1 
receptor. The TLR-4 gene resides in BAC clones 309117 and 152C16. Its sequence (5' 3') 
corresponds to a proximal distal orientation along the chromosome. The complete size and 
intron-exon boundaries of the gene remain to be determined. 
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EXAMPLE 7 

Identification of the Lps d Mutation that Alters the TLR-4 Protein Cytoplasmic 

Domain 



5 Mutations of the Lps gene selectively impede lipopolysaccharide (LPS) signal 

transduction in C3H/HeJ and C57BL/10ScCR mice. Homozygotes display a phenotype 
characterized by resistance to all biological effects of endotoxin and exaggerated susceptibility to 
overwhelming gram-negative infection. The codominant Lps d mutation of C3H/HeJ mice was 
confined to a 0.9 centiMorgan genetic interval, based on the analysis of 2093 meioses. 

10 

A minimal contig, consisting of 20 BAC clones and one YAC clone, was subjected to 
sequence analysis. Approximately 40,000 sequencing reads were obtained from shotgun-cloned 
O genomic DNA, bringing over 1.6 Mb of the central contig to a near-contiguous state and yielding 

L?j rather dense coverage of >95% of the entire critical region. BLAST searches performed on 

15 masked versions of the sequence disclosed dozens of high-scoring homologies with published 

us 

•S expressed sequence tags (ESTs), but these were excluded from consideration as they could not be 

iff cloned from macrophage or fetal cDNA libraries of trusted complexity. Several pseudogenes 

L were observed, but dismissed because they were found to be fragmentary. GRAIL analyses, 

TS=f 

■B performed on long contiguous sequences of the central contig using X-GRAIL software, revealed 

Iff 20 an abundance of retroviral repeats and scattered non-retro viral exons, most of which proved to be 

if? 

% z derived from pseudogenes. 

Two authentic genes (a portion of the Pappa locus and the entire TLR-4 receptor locus) 
were identified in the entire region, by both BLAST and GRAIL analysis. The orientation and 
25 location of these genes, with respect to the nearest genetic markers, is presented in Figure 6. 
Both genes lie at the centromeric end of the critical region, and at most, only a small 5' fragment 
of Pappa lies distal to marker B. 
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Pappa encodes a secreted metalloproteinase, which lacks any evidence of a 
transmembrane domain. It is expressed by placental tissue, but not by primary macrophages or 
macrophage cell lines (not shown), and for these reasons, as well as its extreme proximity to 
marker B (which is separated from Lps d by four crossovers in a panel of 1600 meioses), it is 
5 considered a poor candidate. 

In contrast, the TLR-4 locus seemed a particularly attractive candidate, both on the 
grounds of map position, and because the pro-inflammatory IL-1 receptor is also a member of the 
toll receptor family. Further, a human mutation causing co-resistance to LPS and IL-1 attests to 
10 the likelihood that the IL-1 and LPS signal transduction proteins share a common downstream 
intermediate. A priori, it would therefore seem likely that the DL-1 and LPS transducers are 
structurally related. Finally, the pro-inflammatory signaling potential of TLR-4 is suggested by 
Q studies in which chimeric versions of human TLR-4, bearing a CD4 sequence in place of the 

y native extracellular domain, were shown to be capable of activating NF-6B in human 

% 15 mononuclear cells. 

S Accordingly, the TLR-4 cDNAs from C3H/HeJ mRNA, and from the mRNA of several 

L LPS-responsive strains of mice (including C3H/HeN) were cloned, by reverse transcription and 

? B polymerase chain reaction, using primers derived from the genomic sequence. The amplified 

\n 20 product was fragmented by sonication, shotgun cloned into the vector pBluescript, and sequenced 

if: 

.2 using an Applied Biosystems model 373 DNA sequencer. 100 reads were aligned using the 

programs phred and phrap (obtained from Brent Ewing and Phil Green, University of 
Washington Genome Center). 

25 A single mutation was observed in the 835 aa coding region of the TLR-4 cDNA derived 

from C3H/HeJ mice. At position 712 (within the cytoplasmic domain), a histidine is predicted 
rather than the proline that resides in mouse TLR-4 from C3H/HeN mice, SWR mice, or 
C57BL/6 mice. Furthermore, the residue is invariant across species, in that endotoxin-responsive 
mice, rats and humans all display a proline in the relevant position (Figure 7a). The same 
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mutation was identified in DNA amplified from the C3H/HeJ genome and directly sequenced. It 
was not observed in DNA amplified from the C3H/HeN genome, nor in DNA from SWR, 
C57BL/6, 129, or DBA-2 mice (Figure 7b). No other difference in sequence has ever been 
observed on comparison of any genomic interval from C3H/HeN and C3H/HeJ mice, which 
were, until recently, a single strain. In Fig. 7A the sequence identified as J-toll is the mutant 
protein sequence in which the PRO at residue 712 is mutated to HIS. 

While the TLR-4 cDNA was readily amplified by RT-PCR from macrophage RNA 
derived from C3H/HeJ, C3H/HeN, and C57BL/10ScN mice, it could not be amplified from 
macrophage RNA derived from C57BL/10ScCR mice. On the other hand, several low- 
abundance control cDNAs could be amplified from all strains without difficulty (Figure 8a). 
Moreover, though the toll mRNA is relatively scarce, it could be detected on Northern blots 
prepared using total RNA derived from macrophages of the three former strains; it could not be 
detected in total RNA obtained from C57BI710ScCR mice (Figure 8b). Thus, a c/s-acting 
mutation- yet to be defined at the genomic level- prevents the expression of processed toll 
mRNA in C57BL/10ScCR mice. 

Since a definable mutation exists within TLR-4 in C3H/HeJ mice, and a severe deficiency 
or complete absence of TLR-4 mRNA expression is observed in C57BL/10ScCR mice, it is clear 
that Lps is, in fact, the TLR-4 gene. 

The Lps d mutation of C3H/HeJ mice was originally described as codominant, in the sense 
that Lps d /Lps n heterozygotes show intermediate levels of endotoxin response. The point 
mutation [ 732 (pro6 his)] that we have identified exerts a dominant negative effect on LPS signal 
transduction. Supporting this conclusion, the functionally null (i.e., non-expressed) Lps allele 
represented in C57BL/10ScCR mice is strictly recessive. Since the TLR-4 molecule functions in 
a dimeric state, it is likely that the Lps d mutation renders interplay between normal and abnormal 
subunits unproductive; hence the codominant phenotype. Alternatively, the mutation may cause 
sequestration of a downstream signaling molecule. 
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Insofar as the cross: 
C3H/HeJ X C57BI710ScCR 

5 

yields mice that are profoundly unresponsive to LPS, it may also be inferred that a single copy of 
the Lps d allele (i.e., the genotype Lps d /-) corresponds to a phenotype as unresponsive as that 
conferred by Lps d ILps d . This observation supports the notion that signal transduction proceeds 
directly through the TLR-4 molecule, and tends to detract from the hypothesis that TLR-4 
10 undergoes interaction with a second plasma membrane protein that acts, in turn, as a primary 
LPS transducer. 



Q TLR-4 mRNA is expressed predominantly in lymphoid tissues, and the TLR-4 mRNA is 

JtJ induced by LPS (Figure 9), suggesting that up-regulation of TLR-4 mRNA may perpetuate cell 

:2 15 activation by LPS. Moreover, the well-known phenomenon of endotoxin tolerance is not likely 

kQ to result from down-regulation of TLR-4. 

I~= 
12 i 

^ In Drosophila, several Toll homologs are now known to exist (including 18-wheeler. 

\Q Toll itself engages an extracellular protein ligand, spatzle, which promotes signal transduction 

j-fl 20 via tub and pelle gene products, leading to dissociation of the cytoplasmic inhibitor encoded by 
cactus from a yet-unidentified member of the rel family of transcription factors. This pathway 
culminates in activation of the drosomycin gene, and is required for effective protection against 
fungal infection. 

25 The mammalian ligand for TLR-4 has not yet been identified, and while an endogenous 

ligand may exist, it is equally possible that LPS recognition is the raison d'etre for TLR-4. 
CD14, the best characterized cell surface receptor for LPS, is also a member of the toll 
superfamily, and it is likely that it directly engages TLR-4 upon interaction with LPS, inducing 
signal transduction through the latter protein. 
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TLR-4 signal transduction is believed to proceed through activation of MyD88, a 
cytoplasmic protein bearing homology both to the cytoplasmic domain of toll itself and to a pair 
of death domain motifs, originally described in the TNF receptor family. In addition, the 
5 interleukin-1 receptor associated kinase (IRAK) and TNF-receptor associated factor (TRAF-6) 
lie on the TLR-4 signaling pathway, leading to the activation of AP-1 and NF-6B. It has been 
known for some time that NF-6B activation is essential for transcriptional activation of the 
mouse TNF gene, and that TNF subserves many of the effects of LPS, including the lethal effect. 

10 It has recently been reported that human toll-2 cDNA, transfected into 293 cells, can 

promote LPS signal transduction, given co-expression of CD 14. It was also noted that LPS is 
directly bound by soluble dimerized versions of toll-2, and that toll-2 mutants bearing truncations 
in the cytoplasmic domain exert a dominant inhibitory effect on LPS signaling. LPS signal 
transduction via TLR-4 has never been reported. However, the demonstration that Lps is 

15 identical to TLR-4 leaves no room for doubt that TLR-4 is essential for LPS signaling. In mice 
lacking functional TLR-4, toll-2 does not make a substantial contribution to LPS signal 
transduction; hence C3H/HeJ and C57BL710ScCR mice are entirely refractory to LPS. Though it 
is possible that toll-2, or other members of the toll family, might also be required for LPS 
signaling, the data in hand do not sustain this conclusion. Recently, it was demonstrated that 

20 Chinese hamsters lacking a fucntional copy of a toll-2 gene, still were responsive to endotoxin 
(Heine et aL, 1999), indicating that toll-2 is not essential in LPS signalling. 

Several mammalian toll homologs have now been identified, and several more may exist 
undiscovered. The phenotypic consequences of mutations in genes encoding other members of 
25 the family remain to be seen. Remarkably, C3H/HeJ and C57BL/10ScCR mice are 
developmentally and phenotypically intact, aside from their inability to effectively respond to 
LPS, and to gram-negative infection. Their response to products of gram-positive organisms and 
most other microbes is intact. This fact would suggest that TLR-4 has been retained in evolution 
principally for the purpose of serving the LPS response pathway. Malo and coworkers have 
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recently adduced evidence to suggest that, in birds, distinct allelic forms of Lps may influence 
survival during gram-negative infection. As such, polymorphisms associated with the tenascin 
locus predict outcome following infection of outbred chickens with Salmonella typhimurium. In 
mice, and presumably in birds, the tenascin locus is closely linked to Lps. It is entirely possible 
5 that mutations of the human TLR-4 gene also influence susceptibility to gram-negative infection, 
or its clinical outcome. As two independent mutations of TLR-4 have thus far become fixed in 
mice, it is likely that human populations have also retained TLR-4 mutations. A search for such 
mutations is currently in progress. 

10 EXAMPLE 8 

Genetic Variation at the TLR4 Locus 

y Materials and Methods 

kj Determination of the complete mouse (Tlr4) and human (TLR4) genomic sequences. 

15 The mouse BAC 152C16 (from the 129/J strain; Research Genetics), was earlier shown 

jfij by the inventors to contain the Tlr4 gene in entirety, and a small fraction of Tlr4 was also found 

« to reside in the overlapping BAC 309117 (Poltorak et aL, 1998). Human TLR4 was identified in 

O 

ifl BAC 11 OP 15 (Genome Systems) by hybridization screening. All three BACs were fragmented 

fil by ultrasound, shotgun cloned into the vector pBluescript-KS, and sequenced extensively using 

k n 20 ABI model 373 and 377 sequencers. 959 reads were obtained from 390117, 1503 reads from 
110P15, and 2731 reads from 152C16. The average read length was approximately 700 nt. To 
concentrate data acquisition efforts on the Tlr4 and TLR4 genes themselves, PCR primers were 
fashioned to match regions flanking each gene. A 16 kb fragment was amplified from the mouse 
BAC 152C16, and a 12 kb fragment was amplified from the human BAC 110P15, each 
25 containing all exons of the respective gene. These fragments were also shotgun cloned, and 
sequenced extensively, so that the depth of sequence reached an average of 12 reads over the area 
of greatest interest. Assembly was performed using the programs phred and Phrap (obtained 
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from Brent Ewing and Phil Green, University of Washington Genome Center). Interpretation of 
repetitive elements was achieved with the program RepeatMasker (obtained from Arian Smit, 
University of Washington Genome Center). A contiguous high-quality sequence 18974 bp in 
length containing TLR4 was obtained from the human BAC, and a contig 91748 bp in length 
containing Tlr4 was obtained from the mouse BAC. Over these intervals, the error rate was 
estimated at <1 per 10 4 bp. The sequences have been posted to Genbank in annotated form 
(accession number AF177767 for the murine sequence, SEQ ID NO:48, and accession number 
AF177765 for the human sequence, SEQ ID NO:47). All data related to mutations are presented 
with reference to these sequences. 

Sequencing DNAfrom individual human, mouse, chimpanzee, and baboon samples. 

Human DNA samples were obtained from blood anticoagulated with EDTA, obtained 
from healthy laboratory personnel and from non-selected visitors to the Aston Center outpatient 
clinic in Dallas, TX. The samples, 102 in all, were from a population of mixed ethnicity, but 
were predominantly obtained from Caucasian donors. Samples were prepared using the Wizard 
Genomic DNA Purification kit (Promega). Mouse DNA, obtained from animals of 35 Mus 
musculus strains, was ordered from the Jackson Laboratories. Chimpanzee and baboon DNA 
were obtained from Dr. Kurt Benirschke (U. of CA, San Diego) and Dr. Gregory Delzoppo 
(Scripps Research Institute), respectively. 

The three principal exons of Tlr4 and TLR4 were amplified independently from all 
human and mouse genomic DNA samples, leaving a margin of approximately 50 bp to each side 
of the exons so as to identify intronic mutations that might alter splicing. An alternative second 
exon, apparent in the cDNA sequence has been reported (Rock et aL, 1998) (Gb accession 
number U88880) that specifies a truncated and presumably inactive product. It was not analyzed 
in the population survey. 
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All exons of the chimpanzee were amplified and sequenced using the same primers used 
to amplify and sequence the human exons. For the baboon, the first two exons were also 
amplified using these same primers; however, the third exon of the baboon was amplified with a 
substituted primer at the 5' end. 

The PCR products were isolated by agarose gel electrophoresis. Exons 1 and 2 were 
sequenced using the same primers that were used for amplification. Exon 3 was sequenced using 
the flanking primers, as well as a collection of eight internal primers. In this manner, the entire 
coding region and all splice junctions of the human and mouse sequences could be covered with a 
total of 14 sequencing reads, given that all reads were of high quality. All primers used for 
amplification and sequencing are presented in Table II. 
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Independent assembly of each sample was required as a condition for further analysis, and 
if such assembly failed, additional reads were executed using a secondary collection of primers. 
Thereafter, mutations were identified en mass, by pooling all of the reads from 25 to 30 samples 
at a time and reassembling with the program polyphred, using the phredPhrapPoly script 
(obtained from Natalie Kolker, University of Washington Genome Center). Consed_alpha 
(obtained from David Gordon, University of Washington Genome Center) was used to visualize 
reads and mutations. 

The annotated chimpanzee exon sequences have been submitted to Genbank with the 
accession numbers AF179218, AF179219, and AF179220. The baboon sequences have been 
submitted with the accession numbers [pending]. For the purpose of genetic comparisons, rat 
and hamster Tlr4 sequences were also used; their Genbank accession numbers are AF057025 and 
AF153676, respectively. 

Genetic computation 

A 500 MHz DEC-alpha system equipped with 256 Mbytes of memory was used for direct 
analysis of sequence data as described above. In addition to the programs already mentioned, the 
GCG software (version 9.0) was used for alignment analysis. The windows-based program 
Generunner 3.0 (Hastings Software) was used for the design of oligonucleotide primers. A 
spline curve describing heterogeneity of the Tlr4 polypeptide sequence from different species 
was produced using the program Prism 3.0 (Graphpad Software, Inc). Sequences were prepared 
for submission with the use of the program Sequin 2.90 (obtained from NCBI). 

Results 

Overall structure ofTLR4 and Tlr4. 

The mouse Tlr4 gene is somewhat longer than its human counterpart, owing to the greater 
length of intronic sequence (15337 bp from beginning to end of transcribed sequence in the 
mouse, SEQ ID NO:48, as compared to 1 1467 bp in the human, SEQ ID NO:47). There are three 
exons in Tlr4, and each corresponds to a homologous sequence in the human gene. A human 
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cDNA sequence (Genbank accession number U88880) that includes a fourth exon, positioned 
between the "normal" first and second intron has been reported (Rock et al, 1998). When 
included in the processed transcript, however, this exon specifies early termination of the 
polypeptide chain. While it is possible that translation is initiated distal to the added stop codon, 
and that a shorter product results in the human than in the mouse, such a situation would be 
unusual given the length of the 5'UTR that would then exist, and the presence of multiple 
upstream initiation codons. Moreover, there is no murine sequence homologous to the 
alternative second exon of the human gene. The biological significance of this exon is therefore 
unclear, and in all likelihood, its inclusion in the mRNA leads to the formation of a 
nonfunctional protein product. 

Neither the human nor the mouse gene display a TATA element or CAAT box in the 
proximal promoter region. A number of conserved promoter and enhancer motifs are apparent 
on alignment of the murine and human 5 f flanking sequences. Both Tlr4 and TLR4 lie in the 
midst of repetitive sequences of retroviral origin, and no other genes may be detected in close 
communication with either of them, using homology searches or the gene prediction algorithm 
GRAIL. 

Genetic variation at the human TLR4 locus 

In total, 204 human TLR4 alleles were sequenced in entirety. As such, all but the rarest 
alleles have likely been encountered. According to the formula: 

(l-p) = (l-x) N , 

where p is the probability of detecting an allele, x is the actual frequency of the allele in 
population surveyed, and N is the number of alleles examined, there is a 95% chance that any 
allele with a frequency exceeding 1.46 % was detected through sequencing this population. 

In all, 12 mutable sites were found in human TLR4 (Table m, FIG. 10). Of these, most 
were confined to exon 3, though some were also placed in the second intron. Five changes 
observed in the coding region were silent (i.e., did not produce an amino acid substitution). Five 
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changes did cause amino acid substitutions, one of which affected the cytoplasmic domain, and 
four of which affected the ectodomain. 
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The most common allele in the human population exists at a frequency of 86.3%, 
and 74.5% of the population is homozygous for this allele. The most common TLR4 
polymorphism (designated TLR4-B; Genbank accession number AF 177766) exists at an 
5 allelic frequency of 6%, and consists of a double amino acid substitution (residues 299 
and 399 of the human polypeptide chain; nts 12874 and 13174 of the gene), one of which 
(residue 299) affects an aspartic acid residue conserved in mice, rats, hamsters, and 
chimpanzees (though differing in the baboon sequence). Though the mutations are 
almost always co-inherited, and hence, must lie in cis with one another, a single instance 
10 of mutation at the 12874 site was observed in the absence of the 13174 mutation. This 
would suggest that the double mutation may have arisen from an ancestral allele that is 
now very rare, and conceivably, it may confer an advantage to carriers. 

Of the other mutations observed, 3 modified relatively conserved amino acid 
15 residues, whereas 4 modified relatively variable residues. 

Genetic variation at the mouse Tlr4 locus. 

Among 35 strains of Mus musculus, 10 different alleles were identified, based on 
mutations occurring at 22 sites with respect to the reference sequence, 13 of which create 

20 amino acid substitutions (Table IV; FIG. 11). Hence, greater variation was observed 
among mice than among humans, with the most common murine allele represented at a 
frequency of only 69%. To a far greater extent in mice than in humans, the ancestry of 
different Tlr4 alleles may be traced, as many deviations from the reference allele occur in 
conjuction with one another. A plausible arrangement of strain relationships is presented 

25 in FIG 12. Some strains have accumulated many more mutations than others. For 
example, the P/J strain Tlr4 gene exhibits eleven mutations that distinguish it from the 
most common haplotype, six of them specifying changes in the Tlr4 amino acid sequence; 
the SEA/GnJ strain differs by nine mutations, and the strains NZW/J and VM/Dk, which 
are identical to one another, differ at six sites. Shared mutations suggest that 

30 interbreeding of some strains took place after their initial mutational separation had 
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occurred, leading to the introduction of groups of mutations by genetic recombination. 
Hence, mice of the P/J, NZW/J, and VM/Dk strains have several mutations that are 
observed in the A/J and BALB/c strains, but also lack some of the mutations of the latter 
strains, and have unique mutations of their own. 
1 LEGEND 

1. NZO/HILt 2. SI/Col 3. DBA/IJ 4. A/J 5. EL/Suz 6. CBA/J 7. AKR/J 8. BALB/Cj 9. 
Ddy/jCL 10. P/J 11. MRL/MpJ 12. SJL/J 13. NOD/LtJ 14. 129/J 15. FL/IRe 16. MA/MyJ 
17. SWR/J 18. LP/J 19. PRO/lReJ 20. SODl/Ei 21. SEA/GnJ 22. SM/J 23. KK/HJJ 24. 
ST/bJ 25. WB/Re 26. YBR/Ei 27. FVB/NJ 28. PL/J 29. LT/ChReSv 30. RIUS/J 31. RF/J 
32. NZB/B1NJ 33. AU/SsJ 34. NZW/LacJ 35. VM/Dk 



TABLE IV. 



POLYMORPHISM OF THE Tlr4 Among Mice 


Mouse 1 


Nucleotide 
(Genomic) 


Exon/ 
Intron 


AA 


Receptor 
Domain* 


Conserved 


10 


26400 A -» G 


Exon 2 








4,8,21,22 


37685: (T), 0 


Intron 2 








34,35 


37685: (T), 2 


Intron 2 








23 


37754 G A 


Exon 3 


94D-*N 


Ecto 


Yes 


4,8,10,20,2 
1,34,35 


38101 G -» A 


Exon 3 


209 M -M 


Ecto 


No 


21 


38130 A -»G 


Exon 3 


219D-»G 


Ecto 


No 


4,8,21 


38234 G -> A 


Exon 3 


254 V -» I 


Ecto 


Yes 


10 


38584 A G 


Exon 3 








21 


38742 A -» T 


Exon 3 


423 Q ^ L 


Ecto 


No 


10 


38794 G A 


Exon 3 








10 


38903 G T 


Exon 3 


477 A ^ S 


Ecto 


Yes 


18 


39020 A G 


Exon 3 


516T-»A 


Ecto 


No 


4,8,10,20,2 
1,22,34,35 


39199 C-»T 


Exon 3 
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4,8,10,20,2 
1,34,35 


39253 A -» C 


Exon 3 


593 E ^ D 


Ecto 


No 


23 


39273 A -» T 


Exon 3 


600 N-M 


Ecto 


No 


10 


39383 G A 


Exon 3 


637 V -> I 


TM 


Yes 


19 


39604 T-*C 


Exon 3 








4,8,20,21,3 
4,35 


39631 C-»T 


Exon 3 








4,8,20,21,3 
4,35 


39756 G -» A 


Exon 3 


761 R^H 


Cyto 


Yes 


18 


39826 T C 


Exon 3 








10 


39907 T ■* G 


Exon 3 


811 N^K 


Cyto 


No 
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As in the human TLR4 gene, most of the murine mutations reside within exon 3, 
and only two substitutions are noted to modify the cytoplasmic domain (FIG 12). Of 
these, however, one mutation (R761H), is fairly common among the strains surveyed, and 
corresponds exactly to the human mutation (R763H), observed in one individual out of 
the 102 surveyed. The same residue has been reported as an H in the hamster. A single 
conservative substitution (V637I) was noted within the transmembrane domain of the P/J 
strain. 

Anthropoid ape and lower primate TLR4 sequences, and their relationship to the human 
and rodent sequences 

The human and chimpanzee amino acid sequences are nearly identical over the 
interval studied, distinguished only by three amino acid substitutions. The baboon 
sequence is 93.5% identical to the human in the ectodomain, differs in the transmembrane 
domain by one substitution out of 30 residues, and differs in the proximal cytoplasmic 
domain by only 1 residue in 155. At the C-terminus, however, homology is badly 
disrupted, so that 16 of the last 21 human residues are not replicated in the baboon 
protein, which is 13 amino acids shorter than the human protein. Similarly, among 
rodents, the C-terminus of the protein is the least conserved. Overall, the order of 
conservation with respect to domain is: 

proximal cytoplasmic domain > transmembrane domain > ectodomain > distal 
cytoplasmic domain. 

Indeed, interspecific comparisons of complete Tlr4 amino acid sequence confirm 
the existence of a hypervariable region at the distal end of the Tlr4 cytoplasmic domain 
(Table V, FIG. 13). 
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TABLE V. 



Homology among functional domains of Tlr4 from six species. Percentages refer to 
identity on Fast-A comparison. Ecto, residues 1-631; TM, residues 632-662; Proximal, 
residues 663-819; Distal, residues 820-839. 



ECTO 





100% 














99.6% 


100% 












91.5% 


91.5% 


100% 










61.3% 


60.7% 


59.4% 


100% 








61.9% 


62.0% 


60.1% 


82.9% 


100% 






64.3% 


64.2% 


62.5% 


73.8% 


74.8% 


100% 


I 














TM 




100% 














100% 


100% 












97.1% 


97.1% 


100% 










67.7% 


67.7% 


67.7% 


100% 








70.6% 


70.6% 


70.6% 


91.2% 


100% 






73.5% 


73.5% 


73.5% 


79.4% 


79.4% 


100% 


I •■■ 


iHumanH 












PROXIM 






100% 














99.4% 


100% 












99.4% 


98.7% 


100% 










91.7% 


91.0% 


91.0% 


100% 








90.4% 


89.7% 


89.7% 


98.1% 


100% 






91.7% 


91.0% 


91.0% 


97.4% 


95.5% 


100% 

















DISTAL 





100% 














100% 


100% 












50% 


50% 


100% 










38.1% 


38.1% 


NS 


100% 








26.3% 


26.3% 


NS 


63.2% 


100% 






40.9% 


40.9% 


NS 


40.9% 


45.5% 


100% 




IHumanB 
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EXAMPLE 9 

TIr4-MEDIATED LPS SIGNAL TRANSDUCTION IN MACROPHAGES 



5 The mammalian LPS sensor is formed by a complex array of proteins, some of 

which may as yet be unknown. Complexes of LPS and LBP are initially engaged at the 
cell surface by CD 14, a glycosphosphoinositol-linked protein with no cytoplasmic 
domain (Wright et al. y 1990). The biological relevance of CD14 in LPS signaling, 
originally deduced from binding and transfection studies (Wright, 1990; Kirkland et ai, 
10 1990), is supported by gene knockout data, which revealed that CD 14 expression is 
required for normal sensitivity to LPS (Haziot et al, 1996. However, it was assumed that 
a second, membrane-spanning protein must ultimately transduce the LPS signal, since 
CD 14 lacks a cytoplasmic domain. 

15 The protein that likely fulfills this role was identified through studies of mice, 

ff\ wherein mutations of a single gene (Lps) have long been known to abolish LPS signal 

•2 transduction (Sultzer, 1968; Watson et ai, 1978). The Lps d allele, represented in 

'*i C3H/HeJ mice, is codominant in the sense that Lps d ILps n heterozygotes exhibit 

□ intermediate sensitivity to LPS (Rosenstreich et al, (1978), and their macrophages 

tfa 

^ 20 display intermediate levels of TNF production in response to LPS. The innominate non- 

61 responder allele represented in C57BL/10ScCr mice is recessive to the wild-type allele, in 

»jy that heterozygotes display normal responses to LPS (Coutinho and Meo, 1978). Using 

positional methods, the inventors have determined that Lps encodes the toll-like receptor 
4 (Tlr4), a single- spanning transmembrane protein with a leucine-rich ectodomain and a 
25 "Toll-like" cytoplasmic domain (Poltorak et ai, 1998a; Poltorak et aL, 1998b). The Lps d 
allele bears a missense mutation (2342 C_A; P712H) which lies within the cytoplasmic 
domain of the polypeptide chain. The mutation in C57BL/10ScCr mice is a null allele. 
Relying on these data (Qureshi et ai, 1999a), other workers confirmed the presence of the 
mutations (Qureshi et ai, 1999b). Moreover, Hoshino and colleagues demonstrated that 
30 a Tlr4 knockout produces an excellent phenocopy of the naturally occurring Tlr4 
mutations (Hoshino et aL 9 1999). 

123 
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In order to directly examine the role of Tlr4 as a transducer of the LPS signal in 
macrophages, the inventors expressed the normal mouse protein- and various mutant 
forms- in RAW 264.7 macrophages have been expressed (FIG. 14; Table VIII). These 
cells of murine origin are highly responsive to LPS, and are known to express the Tlr4 
mRNA (Poltorak et al 9 1998b). Unlike cell lines of non-myeloid origin (i.e., 293 cells or 
CHO cells), they express all proteins required for the elicitation of a biologically relevant 
response to LPS (e.g., TNF production). As such, they may be used to determine whether 
Tlr4 is a limiting factor in the initiation of an LPS signal, and to examine the mechanism 
by which the Tlr4 Lps ~ d allele exerts its dominant inhibitory effect on signaling. This was 
determined by performing more than 3,000 assays of TNF production, induced over an 
extensive range of LPS concentrations in numerous stable clones, since individual clones 
show considerable background variability in LPS responses. Moreover, because culture 
conditions can affect the sensitivity of the TNF assay and production of TNF by LPS- 
stimulated cells, all transfected and control clones were induced and assayed in parallel. 
Statistical analyses of shifts in the EC50 values determined for individual clones were 
then applied in order to determine the influence of recombinant protein expression. 

RAW 264.7 cells were first cotransfected with cDNAs derived from the Tlr4 Lps ' n 
and Tlr4 Lps d alleles and with a vector encoding neo. The amino terminus of each protein 
was flag-tagged to permit measurement of expression at the cell surface using the 
monoclonal antibody M2 (obtained from Sigma). After G418 selection, stable clones 
expressing each Tlr4 isoform or transfected with vector alone were examined for LPS 
signal transduction. The binding isotherm from one such clone, transfected with the 
Tlr4 l4)s n construct, is displayed in FIG. 15 A, and the flag copy number for clones bearing 
each construct is displayed in FIG. 15B. Despite the use of a strong promoter, the mean 
copy number rarely exceeded 3 x 10 4 per cell, and the range of expression among all 
clones spanned less than an order of magnitude. The relatively low copy number 
achieved is consistent with the possibility that surface expression may be limited by the 
level of co-expression of other proteins (e.g. MD-2 (Shimazu et al. y 1999)), and the lower 
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copy number in clones bearing truncated constructs as compared with full- length 
constructs may reflect diminished stability. 

Composite EC50 analysis revealed strong augmentation of the LPS response (a 
30-fold leftward shift of the curve) resulting from modest over-expression of the normal 
protein; even stronger suppression (a 2600-fold shift to the right) was observed with 
expression of the mutant isoform. Hence, with respect to the over-expression of Tlr^ 5 "" 
and Tlr4 Lps d proteins, a 74,000-fold difference in the mean response is apparent at the 
EC50 point (FIG. 15C). A more conservative approach, based on measurement of the 
LPS EC50 for each individual clone, also revealed that the Tlr4 Lps ' n isoform strongly 
enhances LPS sensitivity, while the Tlr4 Lps d isoform strongly suppresses it (FIG. 15D). 
According to this method of estimation, the composite range of the responsiveness (mean 
EC50 of Tlrt 1 ^ transfected clones/mean EC50 of Tlri 1 * 8- " transfected clones) was 1 120- 
fold. Taking the dimension of receptor number into account (FIG. 15E), and excluding 
the statistical contribution of the clones transfected with vector alone, no significant 
correlation was observed between LPS signal intensity and the absolute number of 
recombinant Tlr4 molecules on the cell surface. This suggests that the number of 
artificial receptors expressed (usually -1-2 x 10 4 ) is well in excess of the number of 
native receptors, and that maximum augmentation of LPS response is achieved in each 
clone transfected with Tlr4 Lps ' n , with variation attributable to other factors. It is 
interesting in this regard that, on a linear scale, the inhibitory effect of Tlr4 Lps-d over- 
expression vastly exceeds the augmenting effect of Tlr4 Lps * n expression. In fact, most of 
the macrophage clones were rendered virtually unresponsive to LPS through over- 
expression of Tlr4 Lps " d (LPS EC50 > 10 ^g/ml). 

Several conclusions can be drawn from these results. First, the dominant effect 
of the TM 14 ' 5 ^ allele may be directly demonstrated through transfection-based expression 
of the protein at moderate levels in an LPS-responsive macrophage cell line. Second, 
since over-expression of Tlr4 Lps n augments the LPS response, the intensity of the LPS 
signal is normally limited by the quantity of Tlr4 protein on the macrophage membrane. 
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# # 

Third, since there appears to be little correlation between the absolute number of 
recombinant receptors expressed and the magnitude of augmentation or inhibition 
achieved, it may be inferred that saturation of another component of the signaling 
cascade, either proximal or distal to Tlr4, occurs when the level of Tlr4 expression 
5 exceeds a certain threshold: perhaps in the range of several thousand copies per cell. By 
the same token, the level of endogenous Tlr4 expression is probably very low: perhaps 
lower than 10 (Michalek et ai, 1980) copies per cell (a finding consistent with the fact 
that Tlr4 mRNA is of very low abundance (Poltorak et al, 1998b), but nonetheless 
remarkable in view of the global effects wrought by activation of the receptor). Fourth, 
10 and perhaps most important of all, since rather modest changes in the level of normal Tlr4 
protein expression or the expression of a dominant negative Tlr4 isoform can shift LPS 
sensitivity over a range that spans three to four orders of magnitude, one may infer that 
□ Tlr4 is of preponderant importance in LPS signal transduction: there is little room for the 

i7= belief that independent pathways act to transduce the LPS signal as well. This last point 

jff 15 is fully supported by the observation that mutation or deletion of the Tlr4 locus can 

\Q completely abrogate LPS signaling (Sultzer, 1968; Coutinho and Meo, 1978; Poltorak et 

m 

m ai, 1998b). 

=S In one model, the dominant suppressive effect of Tlr4 Lps ' d might be ascribed to the 

jiff 20 postulated multimeric structure of the Tlr4 protein (Schneider et al. 9 1991; Medzhitov et 

al, 1997), given that unproductive association between normal and abnormal subunits 
yields inhibition of signaling. If association between subunits is principally dependent 
upon ectodomain contacts, one would predict that any mutation that disrupts function of 
the Tlr4 cytoplasmic domain might impede signal transduction in a dominant fashion, just 
25 as observed with Tlr4 Lps " d . To examine this hypothesis, the inventors expressed a 
truncated version of the Tlr4 protein, lacking the entire cytoplasmic domain. This protein 
was well expressed on the cell surface, but had only a weak inhibitory effect on LPS 
signaling, which fell short of significance according to the more stringent method of 
analysis (FIG. 16A and FIG. 16B). Hence, the Tlr4 Lps * d isoform exerts a strong dominant 
30 effect whereas deletion of the entire cytoplasmic domain does so weakly at most. The 
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inventors conclude that, if the model of dominant inhibition based on multimeric 
structure is correct, cytoplasmic domain contacts, in addition to ectodomain contacts, 
must serve to maintain the holoprotein in a multimeric state. Given free exchange 
between subunits, mutant Tlr4 proteins lacking a cytoplasmic domain would predictably 
be excluded from the signaling complex in favor of intact subunits that interact more 
strongly with one another. 

The biological consequence of interactions between normal and mutant 
cytoplasmic domains and the intact Tlr4 protein were tested directly. Upon expressing 
membrane-associated versions of the Tlr4 cytoplasmic domain (with either the normal or 
mutant sequence and an exteriorized flag peptide, but lacking the entire ectodomain), the 
inventors observed that the all-cytoplasmic Tlr4 Lps " d isoform inhibited LPS signaling 
(rightward EC50 shift of 39-fold; p=0.0019). The TW 4 " isoform was a significantly 
weaker inhibitor than the Tlr4 Lps ' d isoform (p=0.0214), though it also blocked signaling 
when overexpressed (rightward EC50 shift of 4-fold; p=0.035). These results suggest 
that the Tlr4 Lps d mutation not only permits interaction between normal and mutant 
subunits, but actually enhances the interaction, or actively engages downstream signaling 
molecules, thus blocking the LPS response (FIG. 17A and FIG. 17B). 

The primacy of Tlr4 in LPS signal transduction was originally suggested by 
positional cloning data (Poltorak et ai 9 1998a; Poltorak et al. y 1998b). Direct 
confirmation of Tlr4's function as the transmembrane signaling component of the LPS 
receptor complex has lagged, due largely to the difficulties inherent in transiently 
transfecting macrophage lines with high efficiency. Rather, there has been a tendency to 
utilize non-macrophage lines (e.g., HEK 293 cells) in transfection-based studies of LPS 
signaling. However, these cells do not replicate the biological response phenotype of 
authentic macrophages. In particular, they do not produce TNF in response to LPS; hence, 
a surrogate endpoint of response (nuclear translocation of NF-B, measured using a 
transcriptional reporter) has been adopted in most such studies. However, it is quite clear 
that NF-B translocation cannot be equated with an LPS response, since many stimuli that 
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elicit NF-B Translocation yield effects that have little or nothing in common with the LPS 
response in other respects. Moreover, it has recently been shown that mutational 
inactivation of MyD88, which is known to engage Tlr4, leads to a state of profound LPS 
unresponsiveness, though permitting NF-B Translocation. Finally, given that a response 
of any kind is observed in non-macrophage cell lines, there exists no standard for 
comparison. It has never been clear, for example, that the magnitude of the NF-B 
response approaches that witnessed in a normal macrophage over an identical range of 
LPS concentrations, nor is it known what effect this might have in a macrophage. These 
technical issues have, to date, confounded interpretation of which molecule actually does 
transduce the LPS signal, a role previously ascribed to Tlr2 (Yang et al, 1998; 
Kirschning etai, 1998) but now clearly attributable to Tlr4. 

The present data reveal that Tlr4 is the limiting factor in LPS signal transduction 
in LPS responsive macrophages. Over-expression of Tlr4 in cells that already express it 
augments the LPS response, by about 30-fold on average. The relationship between the 
level of Tlr4 expression and biological response indicates that, although other proteins 
fulfill indispensable functions in LPS signal transduction both upstream (Wright et ai, 
1990) and downstream (Kawai et ai, 1999; Muzio et ai, 1998; Medzhitov et ai, 1998) 
from Tlr4, the quantity of Tlr4 expressed is an important limiting factor in the intensity of 
the signal that is evoked. Hence, sensitivity to LPS is likely controlled through 
modulation of Tlr4 biosynthesis or activity. Priming by interferon (Pace et ai, 1985; Lau 
and Livesey, 1989; Hayes and Zoon, 1993) or by treatment with facultative intracellular 
pathogens (Vogel et ai, 1980; Haranaka et ai, 1984; Matsuura and Galanos, 1990) can 
greatly enhance sensitivity to LPS, while corticosteroids create a state of LPS resistance 
(Beutler et ai, 1986). Such modulation may be achieved through alteration of Tlr4 
structure or expression, or alternatively, through changes in sensitivity to the signal that 
Tlr4 initiates, or changes in the intensity of the signal that Tlr4 receives. 

The over-expression of a membrane-anchored Tlr4 ectodomain (lacking any of the 
wild-type cytoplasmic domain) inhibited the LPS response only weakly, if at all. This 
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failure of the overexpressed ectodomain to block signaling by a competitive mechanism 
implies that upstream components of the signal transduction pathway must either exist in 
excess with respect to Tlr4, or must interact with Tlr4 at very low affinity. There is now 
good reason to doubt the proposal (Modlin et ai, 1999) that the expression of soluble 
5 Tlr4 might prove an effective means of interdicting the LPS signal in vivo, particularly in 
view of the fact that a membrane-anchored form of the protein would be sterically 
positioned to exert such an effect with maximum efficiency, while a soluble form would 
not be. It is, for example, possible to calculate the local concentration of Tlr4 ectodomain 
achieved through over-expression of a membrane-associated version of the protein. 

10 Assuming that the Tlr4 ectodomain resides within a space that is 100 A "deep" from its 
most apical point to the surface of the membrane, and further assuming that the 
macrophage is a spherical body with a 15 jiM radius, the expression of 2 x 10 4 receptors 
per cell corresponds to a protein concentration of 1.2 x 10" 6 M, or 840 |ig of ectodomain 
per ml. While it might be possible to achieve such concentrations of soluble ectodomain 

15 in vivo, it would not be easy to do so, and at that, little or no attenuation of the LPS signal 
would be anticipated. On the other hand, interventions that inhibit contact between Tlr4 
subunits would be likely to have a pronounced impact on signal transduction. 



TABLE Vm 



Construct 


Residues expressed 


Copies/cell 


Asd 


n 


Tlr4 Lpsn 


22 - 835 


2.1 x 10 4 


4.1 x 10 3 


10 


Tlr4 H>s-d 


22-835 (P712H) 


2.0 x 10 4 


1.4 x 10 4 


10 


Thd*** Cyt. Dom. 


630 - 835 


1.2 x 10* 


3.8 x 10 3 


7 


TUrf** Cyt. Dom. 


630 - 835 
(P712H) 


9.9 x 10 3 


4.0 x 10 3 


8 


Ectodomain 


22 - 660 


1.4 x 10 4 


5.1 x 10 3 


9 



20 
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All of the compositions and methods disclosed and claimed herein can be made 
and executed without undue experimentation in light of the present disclosure. While the 
compositions and methods of this invention have been described in terms of preferred 
embodiments, it will be apparent to those of skill in the art that variations may be applied 
to the compositions and methods and in the steps or in the sequence of steps of the 
method described herein without departing from the concept, spirit and scope of the 
invention. More specifically, it will be apparent that certain agents which are both 
chemically and physiologically related may be substituted for the agents described herein 
while the same or similar results would be achieved. All such similar substitutes and 
modifications apparent to those skilled in the art are deemed to be within the spirit, scope 
and concept of the invention as defined by the appended claims. 
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